Relevant  and  Substructural  Logics 

Greg  Restall* 

Philosophy  Department,  Macquarie  University 

Greg . Restall@mq . edu . au 
June  23,  2001 

http: //www.phil .mq. edu. au/staf f /grestall/ 


Abstract  This  is  a  history  of  relevant  and  substructural  logics,  written  for  the  Hand- 
book of  the  History  and  Philosophy  of  Logic,  edited  by  Dov  Gabbay  and  John  Woods. 1 

1  Introduction 

Logics  tend  to  be  viewed  of  in  one  of  two  ways  —  with  an  eye  to  proofs,  or 
with  an  eye  to  models.2  Relevant  and  substructural  logics  are  no  different: 
you  can  focus  on  notions  of  proof,  inference  rules  and  structural  features  of 
deduction  in  these  logics,  or  you  can  focus  on  interpretations  of  the  language 
in  other  structures. 

This  essay  is  structured  around  the  bifurcation  between  proofs  and  mod- 
els: The  first  section  discusses  Proof  Theory  of  relevant  and  substructural  log- 
ics, and  the  second  covers  the  Model  Theory  of  these  logics.  This  order  is  a 
natural  one  for  a  history  of  relevant  and  substructural  logics,  because  much 
of  the  initial  work  —  especially  in  the  Anderson-Belnap  tradition  of  relevant 
logics  —  started  by  developing  proof  theory.  The  model  theory  of  relevant 
logic  came  some  time  later.  As  we  will  see,  Dunn's  algebraic  models  [76,  77] 
Urquhart's  operational  semantics  [267,  268]  and  Routley  and  Meyer's  rela- 
tional semantics  [239,  240,  241]  arrived  decades  after  the  initial  burst  of  ac- 
tivity from  Alan  Anderson  and  Nuel  Belnap.  The  same  goes  for  work  on  the 
Lambek  calculus:  although  inspired  by  a  very  particular  application  in  lin- 
guistic typing,  it  was  developed  first  proof-theoretically,  and  only  later  did 
model  theory  come  to  the  fore.  Girard's  linear  logic  is  a  different  story:  it 
was  discovered  though  considerations  of  the  categorical  models  of  coherence 
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gram. Thanks,  too,  go  to  Nuel  Belnap,  Mike  Dunn,  Bob  Meyer,  Graham  Priest,  Stephen  Read  and 
John  Slaney  for  many  enjoyable  conversations  on  these  topics. 

({This  is  a  draft  and  it  is  not  for  citation  without  permission.  Some  features  are  due  for  severe 
revision  before  publication.  Please  contact  me  if  you  wish  to  quote  this  version.  I  expect  to  have 
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before  emailing  me  with  a  list  of  errors.  But  once  you've  done  that,  by  all  means,  fire  away!}} 

lThe  tide,  Relevant  and  Substructural  Logics  is  not  to  be  read  in  the  same  vein  as  "apples  and 
oranges"  or  'Australia  and  New  Zealand."  It  is  more  in  the  vein  of  "apples  and  fruit"  or  "Australia 
and  the  Pacific  Rim."  It  is  a  history  of  substructural  logics  with  a  particular  attention  to  relevant 
logics,  or  dually,  a  history  of  relevant  logics,  playing  particular  attention  to  their  presence  in  the 
larger  class  of  substructural  logics. 

2Sometimes  you  see  this  described  as  the  distinction  between  an  emphasis  on  syntax  or  se- 
mantics. But  this  is  to  cut  against  the  grain.  On  the  face  of  it,  rules  of  proof  have  as  much  to 
do  with  the  meaning  of  connectives  as  do  model-theoretic  conditions.  The  rules  interpreting  a 
formal  language  in  a  model  pay  just  as  much  attention  to  syntax  as  does  any  proof  theory. 
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spaces.  However,  as  linear  logic  appears  on  the  scene  much  later  than  rele- 
vant logic  or  the  Lambek  calculus,  starting  with  proof  theory  does  not  result 
in  too  much  temporal  reversal. 

I  will  end  with  one  smaller  section  Loose  Ends,  sketching  avenues  for  fur- 
ther work.  The  major  sections,  then,  are  structured  thematically,  and  inside 
these  sections  I  will  endeavour  to  sketch  the  core  historical  lines  of  develop- 
ment in  substructural  logics.  This,  then,  will  be  a  conceptual  history,  indicat- 
ing the  linkages,  dependencies  and  development  of  the  content  itself.  I  will 
be  less  concerned  with  identifying  who  did  what  and  when.3 

I  take  it  that  logic  is  best  learned  by  doing  it,  and  so,  I  have  taken  the  lib- 
erty to  sketch  the  proofs  of  major  results  when  the  techniques  used  in  the 
proofs  us  something  distinctive  about  the  field.  The  proofs  can  be  skipped  or 
skimmed  without  any  threat  to  the  continuity  of  the  story.  However,  to  get  the 
full  flavour  of  the  history,  you  should  attempt  to  savour  the  proofs  at  leisure. 

Let  me  end  this  introduction  by  situating  this  essay  in  its  larger  context 
and  explaining  how  it  differs  from  other  similar  introductory  books  and  es- 
says. Other  comprehensive  introductions  such  as  Dunn's  "Relevance  Logic 
and  Entailment"  [81]  and  its  descendant  "Relevance  Logic"  [94],  Read's  Rel- 
evant Logic  [224]  and  Troelstra's  Lectures  on  Linear  Logic  [264]  are  more  nar- 
rowly focussed  than  this  essay,  concentrating  on  one  or  other  of  the  many 
relevant  and  substructural  logics.  The  Anderson-Belnap  two-volume  Entail- 
ment [10,  11]  is  a  goldmine  of  historical  detail  in  the  tradition  of  relevance 
logic,  but  it  contains  little  about  other  important  traditions  in  substructural 
logics. 

My  Introduction  to  Substructural  Logics  [234]  has  a  similar  scope  to  this 
chapter,  in  that  it  covers  the  broad  sweep  of  substructural  logics:  however, 
that  book  is  more  technical  than  this  essay,  as  it  features  many  formal  re- 
sults stated  and  proved  in  generality.  It  is  also  written  to  introduce  the  subject 
purely  thematically  instead  of  historically. 

2  Proofs 

The  discipline  of  relevant  logic  grew  out  of  an  attempt  to  understand  notions 
of  consequence  and  conditionality  where  the  conclusion  of  a  valid  argument 
is  relevant  to  the  premises,  and  where  the  consequent  of  a  true  conditional  is 
relevant  to  the  antecedent. 

"Substructural"  is  a  newer  term,  due  to  Schroder- Heister  and  Dosen.  They 
write: 

Our  proposal  is  to  call  logics  that  can  be  obtained ...  by  restricting 
structural  rules,  substructural  logics.  [250,  page  6] 

The  structural  rules  mentioned  here  dictate  admissible  forms  of  transforma- 
tions of  premises  in  proofs.  Later  in  this  section,  we  will  see  how  relevant 
logics  are  naturally  counted  as  substructural  logics,  as  certain  commonly  ad- 
mitted structural  rules  are  to  blame  for  introducing  irrelevant  consequences 
into  proofs. 

3  In  particular,  I  will  say  little  about  the  intellectual  ancestry  of  different  results.  I  will  not  trace 
the  degree  to  which  researchers  in  one  tradition  were  influenced  by  those  in  another. 
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Historical  priority  in  the  field  belongs  to  the  tradition  of  relevant  logic,  and 
it  is  to  the  early  stirrings  of  considerations  of  relevance  that  we  will  turn. 

2.1    Relevant  Implication:  Orlov,  Moh  and  Church 

Dosen  has  shown  us  [71]  that  substructural  logic  dates  back  at  least  to  1928 
with  I.  E.  Orlov's  axiomatisation  of  a  propositional  logic  weaker  than  classi- 
cal logic  [207]  .4  Orlov  axiomatised  this  logic  in  order  to  "represent  relevance 
between  propositions  in  symbolic  form"  [71,  page  341].  Orlov's  propositional 
logic  has  this  axiomatisation.5 


o 

double  negation  introduction 

<> 

~~A  — >  A 

double  negation  elimination 

<> 

A  ->  ~(A  ->  ~A) 

contra  posed  reductio 

o 

(A^B)^(^B^ 

~A) 

contraposition 

o 

(A->  (B  ->  C))  -> 

(B-» 

(A- 

*c)) 

permutation 

o 

(A^B)^((C^ 

A)- 

>  (C- 

->B)) 

prefixing 

o 

A,  A  — >  B  ^  B 

modus  ponens 

The  axioms  and  rule  here  form  a  traditional  Hilbert  system.  The  rule  modus 
ponens  is  written  in  the  form  using  a  turnstile  to  echo  the  general  definition 
of  logical  consequence  in  a  Hilbert  system.  Given  a  set  X  of  formulas,  and  a 
single  formula  A,  we  say  that  A  can  be  proved  from  X  (which  I  write  "X  =#>  A") 
if  and  only  if  there  is  a  proof  in  the  Hilbert  system  with  A  as  the  conclusion, 
and  with  hypotheses  from  among  the  set  X.  A  proof  from  hypotheses  is  sim- 
ply a  list  of  formulas,  each  of  which  is  either  an  hypothesis,  an  axiom,  or  one 
which  follows  from  earlier  formulas  in  the  list  by  means  of  a  rule.  In  Orlov's 
system,  the  only  rule  is  modus  ponens.  We  will  see  later  that  this  is  not  nec- 
essarily the  most  useful  notion  of  logical  consequence  applicable  to  relevant 
and  substructural  logics.  In  particular,  more  interesting  results  can  be  proven 
with  consequence  relations  which  do  not  merely  relate  sets  of  formulas  as 
premises  to  a  conclusion,  but  rather  relate  lists,  or  other  forms  of  structured 
collections  as  premises,  to  a  conclusion.  This  is  because  lists  or  other  struc- 
tures can  distinguish  the  order  or  quantity  of  individual  premises,  while  sets 
cannot.  However,  this  is  all  that  can  simply  be  done  to  define  consequence  re- 
lations within  the  confines  of  a  Hilbert  system,  so  here  is  where  our  definition 
of  consequence  will  start. 

These  axioms  and  the  rule  do  not  explicitly  represent  any  notion  of  rele- 
vance. Instead,  we  have  an  axiomatic  system  governing  the  behaviour  of  im- 
plication and  negation.  The  system  tells  us  about  relevance  in  virtue  of  what 

4Allen  Hazen  has  shown  that  in  Russell's  1906  paper  "The  Theory  of  Implication"  his  propo- 
sitional logic  (without  negation)  is  free  of  the  structural  rule  of  contraction  [133,  243].  Only  after 
negation  is  introduced  can  contraction  can  be  proved.  However,  there  seems  to  be  no  real  sense 
in  which  Russell  could  be  pressed  in  to  favour  as  a  proponent  of  substructural  logics,  as  his  aim 
was  not  to  do  without  contraction,  but  to  give  an  axiomatic  account  of  material  implication. 

5The  names  are  mine,  and  not  Orlov's.  I  have  attempted  to  give  each  axiom  or  rule  its  com- 
mon name  (see  for  example  Anderson  and  Belnap's  Entailment  [10]  for  a  list  of  axioms  and  their 
names).  In  this  case,  "contraposed  reductio"  is  my  name,  as  the  axiom  A  — >  ~(A  — »  ~A)  is  a 
rarely  seen  axiom,  but  it  is  a  contraposed  form  of  (A  — »  ~A)  — »  ~A,  which  is  commonly  known 
as  reductio. 
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it  leaves  out,  rather  than  what  it  includes.  Neither  of  the  following  formulas 
are  provable  in  Orlov's  system: 

A^(B^B)  -(B^B)^A 

This  distinguishes  his  logic  from  both  classical  and  intuitionistic  proposi- 
tional  logic.6  If  the  "— >"  is  read  as  either  the  material  conditional  or  the  con- 
ditional of  intuitionistic  logic,  those  formulas  are  provable.  However,  both  of 
these  formulas  commit  an  obvious  failure  of  relevance.  The  consequent  of  the 
main  conditional  need  not  have  anything  to  do  with  the  antecedent.  If  when 
we  say  "if  A  then  B"  we  mean  that  B  follows  from  A,  then  it  seems  that  we 
have  lied  when  we  say  that  "if  A  then  B  — >  B ",  for  B  — >  B  (though  true  enough) 
need  not  follow  from  A,  if  A  has  nothing  to  do  with  B  — >  B.  Similarly,  A  need 
not  follow  from-~(B  — >  B)  (though  ~(B  — >  B)  is  false  enough)  for  again,  A  need 
not  have  anything  to  do  with  ~(B  — >  B).  If  "following  from"  is  to  respect  these 
intuitions,  we  need  look  further  afield  than  classical  or  intuitionistic  propo- 
sitional  logic,  for  these  logics  contain  those  formulas  as  tautologies.  Excising 
these  fallacies  of  relevance  is  no  straightforward  job,  for  once  they  go,  so  must 
other  tautologies,  such  as  these 

o    A  — >  (B  — >  A)  weakening 

o    B  — >  (~B  — >  A)     ex  contradictione  quodlibet 

from  which  they  can  be  derived.7  To  do  without  obvious  fallacies  of  relevance, 
we  must  do  without  these  formulas  too.  And  this  is  exactly  what  Orlov's  sys- 
tem manages  to  do.  His  system  contains  none  of  these  "fallacies  of  relevance", 
and  this  makes  his  system  a  relevant  logic.  In  Orlov's  system,  a  formula  A  — >  B 
is  provable  only  when  A  and  B  share  a  propositional  atom.  There  is  no  way  to 
prove  a  conditional  in  which  the  antecedent  and  the  consequent  have  noth- 
ing to  do  with  one  another.  Orlov  did  not  prove  this  result  in  his  paper.  It  only 
came  to  light  more  than  30  years  later,  with  more  recent  work  in  relevant  logic. 
This  more  recent  work  is  applicable  to  Orlov's  system,  because  Orlov  has  ax- 
iomatised  the  implication  and  negation  fragment  of  the  now  well-known  rel- 
evant logic  R. 

Orlov's  work  didn't  end  with  the  implication  and  negation  fragment  of  a 
relevant  propositional  logic.  He  looked  at  the  behaviour  of  other  connectives 
definable  in  terms  of  conjunction  and  negation.  In  particular,  he  showed  that 
defining  a  conjunction  connective 

 Ao  B  =df  ~(A  ->  ~B) 

6Heyting's  original  text  is  still  a  classic  introduction  to  intuitionistic  logic,  dating  from  this 
era  [134]. 

7Using  substitution  and  modus  ponens,  and  identity.  If  weakening  is  an  axiom  then  (A  — > 
A)  — *  (B  — *  (A  — *  A))  is  an  instance,  and  hence,  by  modus  ponens,  with  A  — *  A,  we  get 
B->(A->A). 
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gives  you  a  connective  you  can  prove  to  be  associative  and  symmetric  and 
square  increasing8 

(AoB)oC    ->  Ao(BoC) 
Ao(BoC)    ->  (AoB)oC 
Ao  B    ->    B  o  A 
A  — >    Ao  A 

However,  the  converse  of  the  "square  increasing"  postulate 

AoA-iA 

is  not  provable,  and  neither  are  the  stronger  versions  AoB->  AorBoA^  A. 
However,  for  all  of  that,  the  connective  Orlov  defined  is  quite  like  a  conjunc- 
tion, because  it  satisfies  the  following  condition: 

=#>  A  ->  (B  ->  C)  if  and  only  if  4  AoB^C 

You  can  prove  a  nested  conditional  if  and  only  if  you  can  prove  the  corre- 
sponding conditional  with  the  two  antecedents  combined  together  as  one. 
This  is  a  residuation  property.9  It  renders  the  connective  o  with  properties 
of  conjunction,  for  it  stands  with  the  implication  o  in  the  same  way  that  ex- 
tensional  conjunction  and  the  conditional  of  intuitionistic  or  classical  logic 
stand  together.10  Residuation  properties  such  as  these  will  feature  a  great  deal 
in  what  follows. 

It  follows  from  this  residuation  properly  that  o  cannot  have  all  of  the  prop- 
erties of  extensional  conjunction.  Ao  B  — >  A  is  not  provable  because  if  it  were, 
then  weakening  axiom  A  — >  (B  — >  A)  would  also  be  provable.  B  o  A  — >  A  is 
not  provable,  because  if  it  were,  B  — >  ( A  — >  A)  would  be. 

In  the  same  vein,  Orlov  defined  a  disjunction  connective 

A  +  B  =df  -A  ->  B 

which  can  be  proved  to  be  associative,  symmetric  and  square  decreasing  (A  + 
A  — >  A)  but  not  square  increasing.  It  follows  that  these  defined  connectives 
do  not  have  the  full  force  of  the  lattice  disjunction  and  conjunction  present 
in  classical  and  intuitionistic  logic.  At  the  very  first  example  of  the  study  of 
substructural  logics  we  are  that  the  doorstep  of  one  of  the  profound  insights 
made  clear  in  this  area:  the  splitting  of  notions  identified  in  stronger  logi- 
cal systems.  Had  Orlov  noticed  that  one  could  define  conjunction  explicitly 
following  the  lattice  definitions  (as  is  done  in  intuitionistic  logic,  where  the 
definitions  in  terms  of  negation  and  implication  also  fail)  then  he  would  have 
noticed  the  split  between  the  intensional  notions  of  conjunction  and  disjunc- 
tion, which  he  defined  so  clearly  and  the  extensional  notions  which  are  dis- 
tinct. We  will  see  this  distinction  in  more  detail  and  in  different  contexts  as 

8Here,  and  elsewhere,  brackets  are  minimised  by  use  of  binding  conventions.  The  general 
rules  are  simple:  conditional-like  connectives  such  as  — »  bind  less  tightly  than  other  two-place 
operators  such  as  conjunction  and  disjunction  (and  fusion  o  and  fission  +)  which  in  turn  bind 
less  tightly  than  one  place  operators.  So,  ~A  V  B  — >  C  A  D  is  the  conditional  whose  antecedent 
is  the  disjunction  of  ~ A  with  B  and  whose  consequent  is  the  conjunction  of  C  with  D . 

9It  ought  to  remind  you  of  simple  arithmetic  results:  x  =  z/y  if  and  only  if  x  x  y  =  z;  x  =  z  —  y 
if  and  only  if  x  +  y  =  z. 
1  "Namely,  that  A  A  B  D  C  is  provable  if  and  only  if  A  d(Bd  C). 
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we  continue  our  story  through  the  decades.  In  what  follows,  we  will  refer  to  o 
and  +  so  much  that  we  need  to  give  them  names.  I  will  follow  the  literature  of 
relevant  logic  and  call  them  fusion  and  fission. 

Good  ideas  have  a  feature  of  being  independently  discovered  and  redis- 
covered. The  logic  R  is  no  different.  Moh  [253]  and  Church  [56] ,  indepen- 
dently formulated  the  implication  fragment  of  R  in  the  early  1950's.  Moh  for- 
mulated an  axiom  system 


o    (A->  B)  ->  ((B  ->  C)  ->  (A  ->  C))  suffixing 

Whereas  Church's  replaces  the  assertion  and  suffixing  with  permutation  and 
prefixing 


Showing  that  these  two  axiomatisations  are  equivalent  is  an  enjoyable  (but 
lengthy)  exercise  in  axiom  chopping.  It  is  a  well-known  result  that  in  the 
presence  either  prefixing  and  suffixing,  permutation  is  equivalent  to  asser- 
tion. Similarly,  in  the  presence  of  either  permutation  or  assertion,  prefixing  is 
equivalent  to  suffixing.  (These  facts  will  be  more  perspicuous  when  we  show 
how  the  presence  of  these  axioms  correspond  to  particular  structural  rules. 
But  this  is  to  get  ahead  of  the  story  by  a  number  of  decades.) 

Note  that  each  of  the  axioms  in  either  Church's  or  Moh's  presentation  of 
R  are  tautologies  of  intuitionistic  logic.  Orlov's  logic  of  relevant  implication 
extends  intuitionistic  logic  when  it  comes  to  negation  (as  double  negation 
elimination  is  present)  but  when  it  comes  to  implication  alone,  the  logic  R  is 
weaker  than  intuitionistic  logic.  As  a  corollary,  Peirce's  law 


is  not  provable  in  R,  despite  being  a  classical  tautology.  The  fallacies  of  rele- 
vance are  examples  of  intuitionistic  tautologies  which  are  not  present  in  rel- 
evant logic.  Nothing  so  far  has  shown  us  that  adding  negation  conservatively 
extends  the  implication  fragment  of  R  (in  the  sense  that  there  is  no  impli- 
cational  formula  which  can  be  proved  with  negation  which  cannot  also  be 
proved  without  it).  However,  as  we  will  see  later,  this  is,  indeed  the  case. 
Adding  negation  does  not  lead  to  new  implicational  theorems. 

Church's  work  on  his  weak  implication  system  closely  paralleled  his  work 
on  the  lambda  calculus.  (As  we  will  see  later,  the  tautologies  of  this  system 
are  exactly  the  types  of  the  terms  in  his  AI  calculus.11)  Church's  work  extends 
that  of  Orlov  by  proving  a  deduction  theorem.  Church  showed  that  if  there  is 
a  proof  with  hypotheses  Ai  to  An  with  conclusion  B,  then  there  is  either  a 
proof  of  B  from  hypotheses  Ai  to  An_i  (in  which  case  An  was  irrelevant  as 
an  hypothesis)  or  there  is  a  proof  of  An  — >  B  from  A] , . . . ,  An_i . 

11  In  which  Ax  can  abstract  a  variable  from  only  those  terms  in  which  the  variable  x  occurs.  As  a 
result,  the  A-term  Ax.Ay  .x,  of  type  A  — >  ( B  — >  A) ,  is  a  term  of  the  traditional  A-calculus,  but  not 
of  the  AI  calculus. 


o    A  — >  A 


identity 

contraction 

assertion 


o  (A->  (A->  B))  ->  (A->  B) 
o  A^((A^B)^B) 


o  (A  ->  (B  ->  C))  ->  (B  ->  (A  ->  C))  permutation 
o    (A  ->  B)  ->  ((C  ->  A)  ->  (C  ->  B))  prefixing 


o    ((A^B)^A)^A     Peirce's  law 
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Fact  1  (Church's  Deduction  Theorem)  In  the  implicational  fragment  of 
the  relevant  logic  R,  if  A] , . . . ,  An  =#>  B  can  be  proved  in  the  Hilbert  system  then 
either  of  the  following  two  consequences  can  also  be  proved  in  that  system. 

o  Ai,...,An_i  =^B, 

o  Ai,...,An_i      An  ->  B. 

Proof  The  proof  follows  the  traditional  proof  of  the  Deduction  Theorem  for 
the  implicational  fragment  of  either  classical  or  intuitionistic  logic.  A  proof 
for  Ai , . . . ,  An  B  is  transformed  into  a  proof  for  Ai , . . . ,  An_i  An  — >  B 
by  prefixing  each  step  of  the  proof  by  "An  — >".  The  weakening  axiom  A  — > 
(B  — >  A)  is  needed  in  the  traditional  result  for  the  step  showing  that  if  an  hy- 
pothesis is  not  used  in  the  proof,  it  can  be  introduced  as  an  antecedent  any- 
way Weakening  is  not  present  in  R,  and  this  step  is  not  needed  in  the  proof  of 
Church's  result,  because  he  allows  a  special  clause,  exempting  us  from  prov- 
ing An  — >  B  when  An  is  not  actually  used  in  the  proof.  □ 

We  will  see  others  later  on  in  our  story.  This  deduction  theorem  lays  some 
claim  to  helping  explain  the  way  in  which  the  logic  R  can  be  said  to  be  rele- 
vant. The  conditional  of  R  respects  use  in  proof.  To  say  that  A  — >  B  is  true  is  to 
say  not  only  that  B  is  true  whenever  A  is  true  (keeping  open  the  option  that  A 
might  have  nothing  to  do  with  B) .  To  say  that  A  — >  B  is  true  is  to  say  that  B  fol- 
lows from  A.  This  is  not  the  only  kind  of  deduction  theorem  applicable  to  rel- 
evant logics.  In  fact,  it  is  probably  not  the  most  satisfactory  one,  as  it  fails  once 
the  logic  is  extended  to  include  extensional  conjunction.  After  all,  we  would 
like  A,B=>AAB  but  we  can  have  neither  A^B^AAB  (since  that  would 
give  the  fallacy  of  relevance  A  =#>  B  — >  A,  in  the  presence  of  A  A  B  — >  A)  nor 
A  =#>  A  A  B  (which  is  classically  invalid,  and  so,  relevantly  invalid) .  So,  another 
characterisation  of  relevance  must  be  found  in  the  presence  of  conjunction. 
In  just  the  same  way,  combining  conjunction-like  pairing  operations  in  the  AI 
calculus  has  proved  quite  difficult  [212].  Avron  has  argued  that  this  difficulty 
should  make  us  conclude  that  relevance  and  extensional  connectives  cannot 
live  together  [13,  14]. 

Meredith  and  Prior  were  also  aware  of  the  possibility  of  looking  for  log- 
ics weaker  than  classical  propositional  logic,  and  that  different  axioms  corre- 
sponded to  different  principles  of  the  A-calculus  (or  in  Meredith  and  Prior's 
case,  combinatory  logic).  Following  on  from  work  of  Curry  and  Feys  [62,  63], 
they  formalised  subsystems  of  classical  logic  including  what  they  called  BCK 
(logic  without  contraction)  and  BCI  (logic  without  contraction  or  weakening: 
which  is  know  known  as  linear  logic)  [  1 69] .  They,  with  Curry,  are  the  first  to  ex- 
plicitly chart  the  correspondence  of  propositional  axioms  with  the  behaviour 
of  combinators  which  allow  the  rearrangement  of  premises  or  antecedents.12 

For  a  number  of  years  following  this  pioneering  work,  the  work  of  Ander- 
son and  Belnap  continued  in  this  vein,  using  techniques  from  other  branches 
of  proof  theory  to  explain  how  the  logic  R  and  its  cousins  respected  condi- 
tions of  relevance  and  necessity.  We  will  shift  our  attention  now  to  another  of 
the  precursors  of  Anderson  and  Belnap's  work,  one  which  pays  attention  to 
conditions  of  necessity  as  well  as  relevance. 

12  It  is  in  their  honour  that  I  use  Curry's  original  terminology  for  the  structural  rules  we  will  see 
later:  W  for  contraction,  K  for  weakening,  C  for  commutativity,  etc. 
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2.2   Entailment:  Ackermann 

Ackermann  formulated  a  logic  of  entailment  in  the  late  1950s  [2].  He  ex- 
tended C.  I.  Lewis'  work  on  systems  of  entailment  to  respect  relevance  and 
to  avoid  the  paradoxes  of  strict  implication.  His  favoured  system  of  entail- 
ment is  a  weakening  of  the  system  S4  of  strict  implication  designed  to  avoid 
the  paradoxes.  Unlike  earlier  work  on  relevant  implication,  Ackermann's  sys- 
tem includes  the  full  complement  of  sentential  connectives. 

To  motivate  the  departures  that  Ackermann's  system  takes  from  R,  note 
that  the  arrow  of  R  is  no  good  at  all  to  model  entailment.  If  we  want  to  say 
that  A  entails  that  B,  the  arrow  of  R  is  significantly  too  strong.  Specifically, 
axioms  such  as  permutation  and  assertion  must  be  rejected  for  the  arrow  of 
entailment.  To  take  an  example,  suppose  that  A  is  contingently  true.  It  is  an 
instance  of  assertion  that 

A^  ((A->  A)  ->  A) 

However,  even  if  A  is  true,  it  ought  not  be  true  that  A  — >  A  entails  A.  For 
A  — >  A  is  presumably  necessarily  true.  We  cannot  not  have  this  necessity 
transferring  to  the  contingent  claim  A.13  Permutation  must  go  too,  as  asser- 
tion is  follows  from  permuting  the  identity  (A  — >  B)  — >  (A  — >  B).  So,  a  logic  of 
entailment  must  be  weaker  than  R.  However,  it  need  not  be  too  much  weaker. 
It  is  clear  that  prefixing,  suffixing  and  contraction  are  not  prone  to  any  sort  of 
counterexample  along  these  lines:  they  can  survive  into  a  logic  of  entailment. 

Ackermann's  original  paper  features  two  different  presentations  of  the  sys- 
tem of  entailment.  The  first,  I',  is  an  ingenious  consecution  calculus,  which 
is  unlike  any  proof  theory  which  has  survived  into  common  use,  so  unfor- 
tunately, I  must  skim  over  it  here  in  one  paragraph.14  The  system  manipu- 
lates consecutions  of  the  form  A,  B  h  C  (to  be  understood  as  A  A  B  — >  C) 
and  A*,  B  h  C  (to  be  understood  as  as  A  — >  (B  — >  C)).  If  you  note  that  the 
comma  in  the  antecedent  place  has  no  uniform  interpretation,  and  that  what 
you  have,  in  effect,  is  two  different  premise  combining  operations.  This  is, 
in  embryonic  form  at  least,  the  first  explicit  case  of  a  dual  treatment  of  both 
intensional  and  extensional  conjunction  in  a  proof  theory  that  I  have  found. 

Ackermann's  other  presentation  of  the  logic  of  entailment  is  a  Hilbert  sys- 
tem. The  axioms  and  rules  are  presented  in  Figure  1.  You  can  see  that  many  of 
the  axioms  considered  have  already  occurred  in  the  study  of  relevant  implica- 
tion. The  innovations  appear  in  both  what  is  omitted  (assertion  and  permu- 
tation, as  we  have  seen)  and  in  the  full  complement  of  rules  for  conjunction 
and  disjunction.15 

To  make  up  for  the  absence  of  assertion  and  permutation,  Ackermann 
adds  restricted  permutation.  This  rule  is  not  a  permutation  rule  (it  doesn't 
permute  anything)  but  it  is  a  restriction  of  the  permutation  rule  to  infer  B  — > 
(A  — >  C)  from  A  — >  (B  — >  C).  For  the  restricted  rule  we  conclude  A  — >  C  from 

13If  something  is  entailed  by  a  necessity,  it  too  is  necessary.  If  A  entails  B  then  if  we  cannot 
have  A  false,  we  cannot  have  B  false  either. 

14The  interested  reader  is  referred  to  Ackermann's  paper  (in  German)  [2]  or  to  Anderson,  Bel- 
nap  and  Dunn's  sympathetic  summary  [11,  §44-46]  (in  English). 

15The  choice  of  counterexample  as  a  thesis  connecting  implication  and  negation  in  place  of 
reductio  (as  in  Orlov)  is  of  no  matter.  The  two  are  equivalent  in  the  presence  of  contraposition 
and  double  negation  rules.  Showing  this  is  a  gentle  exercise  in  axiom-chopping. 
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Axioms 


o 

A 

identity 

o 

(A->  B)  ->  ((C  ->  B)  ->  (C  ->  A)) 

prefixing 

o 

(A-»B)-»((B 

-»C)- 

MA-tC)) 

suffixing 

o 

(A-»  (A-»B)) 

— >  (A  - 

■*B) 

contraction 

o 

AAB— >A,  AAB^B 

conjunction  elimination 

o 

(A^B)A(A- 

■*  C)  -> 

(A^BAC) 

conjunction  introduction 

o 

A  — >  A  V  B,  B  - 

->  A  V  B 

disjunction  introduction 

o 

(A->  C)  A(B  - 

^  C)  -» 

(A  V  B  — >  C) 

disjunction  elimination 

o 

AA(BVC)  -> 

BV(AAC) 

distribution 

o 

(A->  B)  ->  (~B 

^~A) 

contraposition 

<> 

AA^B  ->~(A- 

-»B) 

counterexample 

o 

A^  —A 

double  negation  introduction 

o 

~~A  — >  A 

Rules 

double  negation  elimination 

(a) 

A,  A  — >  B  ^  B 

modus  ponens 

(P) 

A,B4AAB 

adjunction 

(Y) 

A^AVB^B 

disjunctive  syllogism 

(6) 

A^  (B  ->  C),B  =4>  A- 

^  C 

restricted  permutation  rule 

Figure  1:  Ackermann's  axiomatisation  TV 

A  — >  (B  — >  C)  and  B.  Clearly  this  follows  from  permutation.  This  restriction 
allows  a  restricted  form  of  assertion  too. 

o    (A  ->  A')  ->  (((A  ->  A')  -»  B)  ->  B)     restricted  assertion 

This  is  an  instance  of  the  assertion  where  the  first  position  A  is  replaced  by 
the  entailment  A  — >  A'.  While  assertion  might  not  be  valid  for  the  logic  of 
entailment,  it  is  valid  when  the  proposition  in  the  first  position  is  itself  an 
entailment. 

As  Anderson  and  Belnap  point  out  [11,  §8.2],  (6)  is  not  a  particularly  satis- 
factory rule.  Its  status  is  akin  to  that  of  the  rule  of  necessitation  in  modal  logic 
(from  A  to  infer  =#>  □  A) .  It  does  not  extend  to  an  entailment  (A  — >  □  A) .  If  it 
is  possible  to  do  without  a  rule  like  this,  it  seems  preferable,  as  it  licences  tran- 
sitions in  proofs  which  do  not  correspond  to  valid  entailments.  Anderson  and 
Belnap  showed  that  you  can  indeed  do  without  (5)  to  no  ill  effect.  The  system 
is  unchanged  when  you  replace  restricted  permutation  by  restricted  assertion. 

This  is  not  the  only  rule  of  Ackermann's  entailment  which  provokes  com- 
ment. The  rule  {y)  (called  disjunctive  syllogism)  has  had  more  than  its  fair 
share  of  ink  spilled.  It  suffers  the  same  failing  in  this  system  of  entailment 
as  does  (6) :  it  does  not  correspond  to  a  valid  entailment.  The  corresponding 
entailment  A  A  (~A  V  B )  — >  B  is  not  provable.  I  will  defer  its  discussion  to  Sec- 
tion 2.4,  by  which  time  we  will  have  sufficient  technology  available  to  prove 
theorems  about  disjunctive  syllogism  as  well  as  arguing  about  its  significance. 

Ackermann's  remaining  innovations  with  this  system  are  at  least  twofold. 
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First,  we  have  an  thorough  treatment  of  extensional  disjunction  and  con- 
junction. Ackermann  noticed  that  you  need  to  add  distribution  of  conjunc- 
tion over  disjunction  as  a  separate  axiom.16  The  conjunction  and  disjunction 
elimination  and  introduction  rules  are  sufficient  to  show  that  conjunction 
and  disjunction  are  lattice  join  and  meet  on  propositions  ordered  by  provable 
entailment.  (It  is  a  useful  exercise  to  show  that  in  this  system  of  entailment, 
you  can  prove  A  V  ~A,  -(A  A  ~A),  and  that  all  de  Morgan  laws  connecting 
negation,  conjunction  and  disjunction  hold.) 

The  final  innovation  is  the  treatment  of  modality.  Ackermann  notes  that 
as  in  other  systems  of  modal  logic  which  take  entailment  as  primary  it  is  pos- 
sible to  define  the  one-place  modal  operators  of  necessity,  possibility  and 
others  in  terms  of  entailment.  A  traditional  choice  is  to  take  impossibility 
"U"17  defined  by  setting  UA  to  be  A  — >  B  A  ~B  for  some  choice  of  a  contradic- 
tion. Clearly  this  will  not  do  in  the  case  of  a  relevant  logic  as  even  though  it 
makes  sense  to  say  that  if  A  entails  the  contradictory  BA~B  then  A  is  impossi- 
ble, we  might  have  A  entailing  some  contradiction  (and  so,  being  impossible) 
without  entailing  that  contradiction.  It  is  a  fallacy  of  relevance  to  take  all  con- 
tradictions to  be  provably  equivalent.  No,  Ackermann  takes  another  tack,  by 
introducing  a  new  constant  f,  with  some  special  properties.18  The  intent  is 
to  take  f  to  mean  "some  contradiction  is  true".  Ackermann  then  posits  the 
following  axioms  and  rules. 

o       A  A  -A  ->  f 
o       (A  ->  f )  ->  ^A 

(e)  A^B,(A^B)AC^f4C^f 

Clearly  the  first  two  are  true,  if  we  interpret  f  as  the  disjunction  of  all  contra- 
dictions. The  last  we  will  not  tarry  with.  It  is  an  idiosyncratic  rule,  distinctive 
to  Ackermann.  More  important  for  our  concern  is  the  definition  of  f .  It  is 
a  new  constant,  with  new  properties  which  open  up  once  we  enter  the  sub- 
structural  context.  Classically  (or  intuitionistically)  f  would  behave  as  _L,  a 
proposition  which  entails  all  others.  In  a  substructural  logic  like  R  or  Acker- 
mann's  entailment,  f  does  no  such  thing.  It  is  true  that  f  is  provably  false  (we 
can  prove  ~f,  from  the  axiom  (f  — >  f )  — >  ~f)  but  it  does  not  follow  that  f  en- 
tails everything.  Again,  a  classical  notion  splits:  there  are  two  different  kinds 
of  falsehood.  There  is  the  Ackermann  false  constant  f ,  which  is  the  weakest 
provably  false  proposition,  and  there  is  the  Church  false  constant  _L,  which  is 
the  strongest  false  proposition,  which  entails  every  proposition  whatsoever. 
Classically  and  intuitionistically  both  are  equivalent.  Here,  they  come  apart. 

The  two  false  constants  are  mirrored  by  their  negations:  two  true  con- 
stants. The  Ackermann  true  constant  t  (which  is  ~f)  is  the  conjunction  of  all 
tautologies.  The  Church  true  constant  T  (which  is  ~_L)  is  the  weakest  propo- 
sition of  all,  such  that  A  — >  T  is  true  for  each  A.  If  we  are  to  define  necessity 
by  means  of  a  propositional  constant,  then  t  — >  A  is  the  appropriate  choice. 
For  t  — >  A  will  hold  for  all  provable  A.  Choosing  T  — >  A  would  be  much  too 

16If  we  have  the  residuation  of  conjunction  by  3  (intuitionistic  or  classical  material  implica- 
tion) then  distribution  follows.  The  algebraic  analogue  of  this  result  is  the  thesis  that  a  residuated 
lattice  is  distributive. 

17For  unmoglich. 

18Actually,  Ackermann  uses  the  symbol  "A",  but  it  now  appears  in  the  literature  as  "f ". 
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restrictive,  as  we  would  only  allow  as  "necessary"  propositions  which  were 
entailed  by  all  others.  Since  we  do  not  have  AV-A^  B  V  ~B,  if  we  want 
both  to  be  necessary,  we  must  be  happy  with  the  weaker  condition,  of  being 
entailed  by  t. 

This  choice  of  true  constant  to  define  necessity  motivates  the  choice  that 
Anderson  and  Belnap  used,  t  must  entail  each  proposition  of  the  form  A  — >  A 
(as  each  is  a  tautology).  Anderson  and  Belnap  showed  that  t  — >  A  in  Acker- 
mann's  system  is  equivalent  to  (A  — >  A)  — >  A,  and  so  they  use  (A  — >  A)  — >  A 
as  a  definition  of  DA,  and  in  this  way,  they  showed  that  it  was  possible  to 
define  the  one-place  modal  operators  in  the  original  language  alone,  with- 
out the  use  of  propositional  constants  at  all.19  It  is  instructive  to  work  out 
the  details  of  the  behaviour  of  □  as  we  have  defined  it.  Necessity  here  has 
properties  roughly  of  S4.  In  particular,  you  can  prove  DA  — >  EDA  but  not 
OA  — >  DOA  in  Ackermann's  system.20  (You  will  note  that  using  this  defini- 
tion of  necessity  and  without  (6)  you  need  to  add  an  axiom  to  the  effect  that 
□A  A  DB  — >  E(A  A  B),21  as  it  cannot  be  proved  from  the  system  as  it  stands. 
Defining  DA  as  t  — >  A  does  not  have  this  problem.) 

2.3   Anderson  and  Belnap 

We  have  well-and-truly  reached  beyond  Ackermann's  work  on  entailment  to 
that  of  Alan  Anderson  and  Nuel  Belnap.  Anderson  and  Belnap  started  their 
exploration  of  relevance  and  entailment  with  Ackermann's  work  [6,  8] ,  but 
very  soon  it  became  an  independent  enterprise  with  a  wealth  of  innovations 
and  techniques  from  their  own  hands,  and  from  their  students  and  colleagues 
(chiefly  J.  Michael  Dunn,  Robert  K.  Meyer,  Alasdair  Urquhart,  Richard  Rout- 
ley  (later  known  as  Richard  Sylvan)  and  Kit  Fine).  Much  of  this  research  is  re- 
ported in  the  two-volume  Entailment  [10, 1 1] ,  and  in  the  papers  cited  therein. 
There  is  no  way  that  I  can  adequately  summarise  this  work  in  a  few  pages. 
However,  I  can  sketch  what  I  take  to  be  some  of  the  most  important  enduring 
themes  of  this  tradition. 

2. 3. 1    Fitch  Systems 

Hilbert  systems  are  not  the  most  only  way  to  present  proofs.  Other  proof  the- 
ories give  us  us  different  insights  into  a  logical  system  by  isolating  rules  rele- 
vant to  each  different  connective.  Hilbert  systems,  with  many  axioms  and  few 
rules,  are  not  so  suited  to  a  project  of  understanding  the  internal  structure  of  a 
family  of  logical  systems.  It  is  no  surprise  that  in  the  relevant  logic  tradition, 
a  great  deal  of  work  was  invested  toward  providing  different  proof  theories 
which  model  directly  the  relationship  between  premises  and  conclusions. 

The  first  natural  deduction  system  for  R  and  E  (Anderson  and  Belnap's  sys- 
tem of  entailment)  was  inspired  by  Fitch's  natural  deduction  system,  in  com- 
mon use  in  undergraduate  and  postgraduate  logic  instruction  in  the  1950s  in 
the  United  States  [100]. 22  A  Fitch  system  is  a  linear  presentation  of  a  natural 

19 Impossibility  U A  is  then  A  — >  ~(A  — >  A). 
20Defining  OA  as  ~U~A,  i.  e.,  as~((A  ->  A)  ->  ~A). 

21The  axiom  is  ungainly  when  it  is  written  out  in  full:  ((A  — >  A)  — >  A)  A  ((B  — >  B)  — >  B)  — > 
((AAB^AAB)^AAB). 

22That  Fitch  systems  would  be  used  by  Anderson  and  Belnap  is  to  be  expected.  It  is  also  to 
be  expected  that  Read  [224]  and  Slaney  [256]  (from  the  U.  K.)  use  Lemmon-style  natural  deduc- 
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deduction  proof,  with  introduction  and  elimination  rules  for  each  connec- 
tive, and  the  use  of  vertical  lines  to  present  subproofs  —  parts  of  proofs  un- 
der hypotheses.  Here,  for  example,  is  a  proof  of  the  relevantly  unacceptable 
weakening  axiom  in  a  Fitch  system  for  classical  (or  intuitionistic)  logic: 


1 

2 
3 
4 
5 


A 


B 
A 


B  ->  A 
A^  (B  ->  A) 


hyp 
hyp 
1  reit 
2-3  ->I 
1-4 ->I 


Each  line  is  numbered  to  the  left,  and  the  annotation  to  the  right  indicates 
the  provenance  of  each  formula.  A  line  marked  with  "hyp"  is  an  hypothesis, 
and  its  introduction  increases  the  level  of  nesting  of  the  proof.  In  line  4  we 
have  the  application  of  conditional  proof,  or  as  it  is  indicated  here,  implica- 
tion introduction  (— >I).  Since  A  has  been  proved  under  the  hypothesis  of  B, 
we  deduce  B  — >  A,  discharging  that  hypothesis.  The  other  distinctive  feature 
of  Fitch  proofs  is  the  necessity  to  reiterate  formulas.  If  a  formula  appears  out- 
side a  nested  subproof,  it  is  possible  to  reiterate  it  under  the  assumption,  for 
use  inside  the  subproof. 

Now,  this  proof  is  defective,  if  we  take  — >  to  indicate  relevant  implication. 
There  are  two  possible  points  of  disagreement.  One  is  to  question  the  proof 
at  the  point  of  line  3:  perhaps  something  has  gone  wrong  at  the  point  of  re- 
iterating A  in  the  subproof.  This  is  not  where  Anderson  and  Belnap  modify 
Fitch's  system  in  order  to  model  R.23  As  you  can  see  in  the  proof  of  (relevantly 
acceptable)  assertion  axiom,  reiteration  of  a  formula  from  outside  a  subproof 
is  unproblematic. 


1 

2 
3 
4 
5 
6 


A 

A^  B 
A 
B 

(A->  B)  ->  B 
->  ((A->B)  - 


hyp 
hyp 
1  reit 

2-3  ->E 
2-4 ->I 
1-5 ->I 


The  difference  between  the  two  proofs  indicates  what  has  gone  wrong  in  the 
proof  of  the  weakening  axiom.  In  this  proof,  we  have  indeed  used  A  — >  B 
in  the  proof  of  B  from  lines  1  to  4.  In  the  earlier  "proof",  we  indeed  proved 
A  under  the  assumption  of  B  but  we  did  not  use  B  in  that  proof.  The  impli- 
cation introduction  in  line  4  is  fallacious.  If  I  am  to  pay  attention  to  use  in 
proof,  I  must  keep  track  of  it  in  some  way.  Anderson  and  Belnap's  innovation 
is  to  add  labels  to  formulas  in  proofs.  The  label  is  a  set  of  indices,  indicating 
the  hypotheses  upon  which  the  formula  depends.  If  I  introduce  an  hypoth- 
esis A  in  a  proof,  I  add  a  new  label,  a  singleton  of  a  new  index  standing  for 

tion  [153],  modelled  after  Lemmon's  textbook,  used  in  the  U.  K.  for  many  years.  Logicians  on 
continental  Europe  are  much  more  likely  to  use  Prawitz  [214]  or  Gentzen-style  [111,  112]  natu- 
ral deduction  systems.  This  geographic  distribution  of  pedagogical  techniques  (and  its  result- 
ing influence  on  the  way  research  is  directed,  as  well  as  teaching)  is  remarkably  resilient  across 
the  decades.  The  recent  publication  of  Barwise  and  Etchemendy's  popular  textbook  introducing 
logic  still  uses  a  Fitch  system  [19].  As  far  as  I  am  aware,  instruction  in  logic  in  none  of  the  major 
centres  in  Europe  or  Australia  centres  on  Fitch-style  presentation  of  natural  deduction. 

23  Restricting  reiteration  is  the  way  to  give  hypothesis  generation  and  conditional  introduction 
modal  force,  as  we  shall  see  soon. 
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that  hypothesis.  The  implication  introduction  and  elimination  rules  must  be 
amended  to  take  account  of  labels.  For  implication  elimination,  given  A  Q  and 
A  — >  Bb,  I  conclude  BaUb,  for  this  instance  of  B  in  the  proof  depends  upon  ev- 
erything we  needed  for  A  and  for  A  — >  B.  For  implication  elimination,  given 
a  proof  of  BQ  under  the  hypothesis  A{i},  I  can  conclude  A  — >  BQ_W,  provided 
that  i  e  a.  With  these  amended  rules,  we  can  annotate  the  original  proof  of 
assertion  with  labels,  as  follows. 


1 

2 
3 
4 
5 
6 


A  -»  B{2} 
A(i} 

B{1,2} 

(A-»B)-»B{1} 
A^  ((A->  B)  ->  B) 


hyp 
hyp 
1  reit 

2-3  — >E 
2-4->I 
1-5  ->I 


The  proof  of  weakening,  on  the  other  hand,  cannot  be  annotated  with  labels 
satisfying  the  rules  for  implication. 


1 

/ 

41} 

hyp 

2 

B{2} 

hyp 

3 

A(i} 

1  reit 

4 

E 

— >  A??? 

2-3  ->I 

5 

A- 

->  (B  — >  A)??? 

1^  ->I 

Modifying  the  system  to  model  entailment  is  straightforward.  As  I  hinted  ear- 
lier, if  the  arrow  has  a  modal  force,  we  do  not  want  unrestricted  reiteration. 
Instead  of  allowing  an  arbitrary  formula  to  be  reiterated  into  a  subproof,  since 
entertaining  an  hypothesis  now  has  the  force  of  considering  an  alternate  pos- 
sibility, we  must  only  allow  for  reiteration  formulas  which  might  indeed  hold 
in  those  alternate  possibilities.  Here,  the  requisite  formulas  are  entailments. 
Entailments  are  not  only  true,  but  true  of  necessity,  and  so,  we  can  reiterate 
an  entailment  under  the  context  of  an  hypothesis,  but  we  cannot  reiterate 
atomic  formulas.  So,  the  proof  above  of  assertion  breaks  down  at  the  point 
at  which  we  wished  to  reiterate  A  into  the  second  subproof.  The  proof  of  re- 
stricted assertion  will  succeed. 


1 

2 
3 
4 
5 
6 


A  — >  A'{i} 
"  (A-»A') 
A->A{'1} 

B{1,2} 

((A->A')- 
(A->  A') 


Hi) 


B)->B< 


1! 


(((A->  A')  ->  B)  ->  B) 


hyp 
hyp 
1  reit 
2-3  ^E 
2-4 ->I 
1-5  ^1 


This  is  a  permissible  proof  because  we  are  entitled  to  reiterate  A  — >  A '  at  line 
3.  Even  given  the  assumption  that  (A  — >  A')  — >  B,  the  prior  assumption  of 
A  — >  A'  holds  in  the  new  context. 

Here  is  a  slightly  more  complex  proof  in  this  Fitch  system  for  entailment. 
(Recall  that  DA  is  shorthand  for  (A  — >  A)  — >  A,  for  Anderson  and  Belnap's 
system  of  entailment.)  This  proof  shows  that  in  E,  the  truth  of  an  entailment 
(here  B  — >  C)  entails  that  anything  entailed  by  that  entailment  (here  A)  is 
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itself  necessary  too.  The  reiterations  on  lines  4  and  5  are  permissible,  because 
B  — >  C  and  (B  — >  C)  — >  A  are  both  entailments. 


We  say  that  A  follows  relevantly  from  B  when  a  proof  of  with  hypothesis  A{i} 
concludes  in  B{i}.  This  is  written  "A  h  B."  We  say  that  A  is  provable  by  itself 
when  there  is  a  proof  of  A  with  no  label  at  all.  Then  the  Hilbert  system  and 
the  natural  deduction  system  match  in  the  following  two  senses. 

Fact  2  (Hilbert  and  Fitch  Equivalence)  A  — >  B  if  and  only  if  A  h  B. 
=#>  A  if  and  only  ifV-  A. 

Proof  The  proof  is  by  an  induction  on  the  complexity  of  proofs  in  both 
directions.  To  convert  a  Fitch  proof  to  a  Hilbert  proof,  we  replace  the  hy- 
potheses A{i}  by  the  identity  A  — >  A,  and  the  arbitrary  formula  B{il  ,i2,...,in} 
by  Ai  — >  (A2  —>•••—>  (An  — >  B)  •  •  •)  (where  Aj  is  the  formula  introduced 
with  label  Aj ) .  Then  you  show  that  the  the  steps  between  these  formulas  can 
be  justified  in  the  Hilbert  system.  Conversely,  you  simply  need  to  show  that 
each  Hilbert  axiom  is  provable  in  the  Fitch  system,  and  that  modus  ponens 
preserves  provability.  Neither  proof  is  difficult.  □ 
Other  restrictions  on  reiteration  can  be  given  to  this  Fitch  system  in  order  to 
model  even  weaker  logics.  In  particular,  Anderson  and  Belnap  examine  a  sys- 
tem T  of  ticket  entailment,  whose  rationale  is  the  idea  that  statements  of  the 
form  A  — >  B  are  rules  but  not  facts.  They  are  to  be  used  as  major  premises  of 
implication  eliminations,  but  not  as  minor  premises.  The  restriction  on  reit- 
eration to  get  this  effect  allows  you  to  conclude  BQUb  from  AQ  and  A  — >  Bb) 
provided  that  n\ax[\>)  <  max(a).  The  effect  of  this  is  to  render  restricted  as- 
sertion unprovable,  while  identity,  prefixing,  suffixing  and  contraction  remain 
provable  (and  these  axiomatise  the  calculus  T  of  ticket  entailment).24  (It  is  an 
open  problem  to  this  day  whether  the  implicational  fragment  of  T  is  decid- 
able.) 

Before  considering  the  extension  of  this  proof  theory  to  deal  with  the  ex- 
tensional  connectives,  let  me  note  one  curious  result  in  the  vicinity  of  T.  The 
logic  TW  you  get  by  removing  contraction  from  T  has  a  surprising  property. 
Errol  Martin  has  shown  that  if  A  — >  B  and  B  — >  A  are  provable  in  TW,  then  A 
and  B  must  be  the  same  formula  [166]. 25 

24This  is  as  good  a  place  as  any  to  note  that  the  axiom  of  self  distribution  (A  -)  (B  -)  C))  -) 
((A-)B)  — >  (A  — >  C))  will  do  instead  of  contraction  in  any  of  these  axiomatisations. 

25  Martin's  proof  proceeds  via  a  result  showing  that  the  logic  given  by  prefixing  and  suffixing 
(without  identity)  has  no  instances  of  identity  provable  at  all.  This  is  required,  for  (A  — >  B)  — > 
((B  — >  A)  — >  (A  — >  A))  is  an  instance  of  suffixing.  The  system  S  (for  syllogism)  has  interesting 
properties  in  its  own  right,  modelling  noncircular  (non  "question  begging"?)  logic  [165]. 


2 
3 
4 
5 
6 
7 
8 
9 

10 


Ad  ,2}  4,5  ^E 

A{i,2,3}  3,6^E 

(A^A)^A{li2}  3-7 ->I 

((B  — >  C)  — >  A)  — »'dA{i}  2-8  — >I 

B  ->  C)  ->  (((B  ->  C)  ->  A)  ->  DA)  1-9 ->I 
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2.3.2   First  Degree  Entailment 

It  is  one  thing  to  provide  a  proof  theory  for  implication  or  entailment.  It  is 
another  to  combine  it  with  a  theory  of  the  other  propositional  connectives: 
conjunction,  disjunction  and  negation.  Anderson  and  Belnap's  strategy  was 
to  first  decide  the  behaviour  of  conjunction,  disjunction  and  negation,  and 
then  combine  this  theory  with  the  theory  of  entailment  or  implication.  This 
is  gives  the  structure  of  the  first  volume  of  Entailment  [10] .  The  first  100  pages 
deals  with  implication  alone,  the  next  50  with  implication  and  negation,  the 
next  80  with  the  first  degree  fragment  (entailments  between  formulas  not  in- 
cluding implication)  and  only  at  page  231  do  we  find  the  formulation  of  the 
full  system  E  of  entailment. 

Anderson  and  Belnap's  work  on  entailments  between  truth  functions  (or 
what  they  call  first  degree  entailment)  dates  back  to  a  paper  in  1962  [9] .  There 
are  many  different  ways  to  carve  out  first  degree  entailments  which  are  rel- 
evant from  those  which  are  not.  For  example,  filter  techniques  due  to  von 
Wright  [288],  Lewy  [155],  Geach  [109]  and  Smiley  [257]  tell  us  that  statements 
like 

A-iBV-B  AA^A^B 

fail  as  entailments  because  there  is  no  atom  shared  between  antecedent  and 
consequent.  So  far,  so  good,  and  their  account  follows  Anderson  and  Bel- 
nap's. However,  if  this  is  the  only  criterion  to  add  to  classical  entailment,  we 
allow  through  as  entailments  their  analogues: 

A  — >  A  A  (B  V  ~B)       (A  A  ~A)  VB^B 

for  the  propositional  atom  A  is  shared  in  the  first  case,  and  B  in  the  second. 
Since  both  of  the  following  classical  entailments 

A  A  (B  V  ~B)  — >  B  V  ~B       A  A  ^A  ->  (A  A  ~A)  V  B 

also  satisfy  the  atom-sharing  requirement,  using  variable  sharing  as  the  only 
criterion  makes  us  reject  the  transitivity  of  entailment.  After  all,  given  A  — > 
A  A  (B  V  ~B)  and  given  AA(BV-B)  ->  B  V  ~B,  if  ->  is  transitive,  we  get 
A-i  BV ~B.26 

Anderson  and  Belnap  respond  by  noting  that  if  A  — >  B  V  ~B  is  problematic 
because  of  relevance,  then  A  — >  A  A  (B  V  ~B)  is  at  least  50%  problematic  [10, 
page  155] .  Putting  things  another  way,  if  to  say  that  A  entails  B  A  C  is  at  least 
to  say  that  A  entails  B  and  that  A  entails  C,  then  we  cannot  just  add  a  blanket 
atom-sharing  criterion  to  filter  out  failures  of  relevance,  for  it  might  apply  to 
one  conjunct  and  not  the  other.  Filter  techniques  do  not  work. 

Anderson  and  Belnap  characterise  valid  first  degree  entailments  in  a  num- 
ber of  ways.  The  simplest  way  which  does  not  use  any  model  theory  is  a  nor- 
mal form  theorem  for  first  degree  entailments.  We  will  use  a  process  of  re- 
duction to  transform  arbitrary  entailments  into  primitive  entailments,  which 

26Nontransitive  accounts  of  entailment  have  survived  to  this  day,  with  more  sophistication. 
Neil  Tennant  has  an  idiosyncratic  approach  to  normalisation  in  logics,  arguing  for  a  "relevant 
logic"  which  differs  from  our  substructural  logics  by  allowing  the  validity  of  A  V  B,  ~A  h  B  and 
A  h  A  V  B,  while  rejecting  A,~A  h  B  [261,  262].  Tennant's  system  rejects  the  unrestricted 
transitivity  of  proofs:  the  'Cut'  which  would  allow  A,  ~A  h  B  from  the  proofs  of  A  V  B ,  ~A  h  B 
and  A  h  A  V  B  is  not  admissible.  Tennant  uses  normalisation  to  motivate  this  system. 


Greg  Restall,  Greg . Restaliamq .  edu .  au 


June  23,  2001 


http: //www. phil .mq.edu. au/staf f /grestall/ 


16 


we  can  determine  on  sight.  The  first  part  of  the  process  is  to  drive  negations 
inside  other  operators,  leaving  them  only  on  atoms.  We  use  the  de  Morgan 
equivalences  and  the  double  negation  equivalence  to  do  this.27 

~(A  VB)  h  ~A  A  ~B       ^(AAB)h^AV-B  -AhA 

(I  write  "A  <->  B"  here  a  shorthand  for  "both  A  ->  B  and  B  ->  A".) 

The  next  process  involves  pushing  conjunctions  and  disjunctions  around. 
The  aim  is  to  make  the  antecedent  of  our  putative  entailment  a  disjunction 
of  conjunctions,  and  the  consequent  a  conjunction  of  disjunctions.  We  use 
these  distribution  facts  to  this  effect.28 

(AVB)ACh  (A  AC)  V(BAC)  (AAB)VCh(AVC)A(BVC) 

With  that  transformation  done,  we  break  the  entailment  up  into  primitive  en- 
tailments in  these  two  kinds  of  steps: 

AVB^C  if  and  only  if  A  ->  C  and  B  ->  C 

A  ->  B  AC  if  and  only  if  A  ->  B  and  A  ->  C 

Each  of  these  transformation  rules  is  intended  to  be  unproblematically  valid 
when  it  comes  to  relevant  entailment.  The  first  batch  (the  negation  condi- 
tions) seem  unproblematic  if  negation  is  truth  functional.  The  second  batch 
(the  distribution  conditions,  together  with  the  associativity,  commutativity 
and  idempotence  of  both  disjunction  and  conjunction)  are  sometimes  ques- 
tioned29 but  we  have  been  given  no  reason  yet  to  quibble  with  these  as  rel- 
evant entailments.  Finally,  the  steps  to  break  down  entailments  from  dis- 
junctions and  entailments  to  disjunction  are  fundamental  to  the  behaviour  of 
conjunction  and  disjunction  as  lattice  connectives.  They  are  also  fundamen- 
tal to  inferential  properties  of  these  connectives.  A  V  B  licences  an  inference 
to  C  (and  a  relevant  one,  presumably!)  if  and  only  if  A  and  B  both  licence  that 
inference.  B  A  C  follows  from  A  (and  relevantly  presumably!)  if  and  only  if  B 
and  C  both  follow  from  A. 

The  result  of  the  completed  transformation  will  then  be  a  collection  of 
primitive  entailments:  each  of  which  is  a  conjunction  of  atoms  and  negated 
atoms  in  the  antecedent,  and  a  disjunction  of  atoms  and  negated  atoms  in 
the  consequent.  Here  are  some  examples  of  primitive  entailments: 

pA-p^qV-q       p^pV~p  pA-pA-qAr-isV-sVqV-r 

Anderson  and  Belnap's  criterion  for  deciding  a  primitive  entailment  is  simple. 
A  primitive  entailment  A  — >  B  is  valid  if  and  only  if  one  of  the  conjuncts  in  the 
antecedent  also  features  as  a  disjunct  in  the  consequent.  If  there  is  such  an 
atom,  clearly  the  consequent  follows  from  the  antecedent.  If  there  is  no  such 

27 We  also  lean  on  the  fact  that  we  can  replace  provable  equivalents  ad  libitum  in  formulas. 
Formally,  if  we  can  prove  A  — *  B  and  B  — >  A  then  we  can  prove  C  -)  C'  and  C'  — »  C,  where 
C '  results  from  C  by  changing  as  many  instances  of  A  to  B  in  C  as  you  please.  All  substructural 
logics  satisfy  this  condition. 

28Together  with  the  associativity,  commutativity  and  idempotence  of  both  disjunction  and 
conjunction,  which  I  will  not  bother  to  write  out  formally. 

29We  will  see  later  that  linear  logic  rejects  the  distribution  of  conjunction  over  disjunction. 
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atom,  the  consequent  may  well  be  true  (and  perhaps  even  necessarily  so,  if  an 
atom  and  its  negation  both  appear  as  disjuncts)  but  its  truth  does  not  follow 
from  the  truth  of  the  antecedent.  This  makes  some  kind  of  sense:  what  is  it 
for  the  consequent  to  be  true?  It's  for  Bi  or  B2  or  B3  . . .  to  be  true.  (And  that's 
all,  as  that's  all  that  the  consequent  says.)  If  none  of  these  things  are  given 
by  the  antecedent,  then  the  consequent  as  a  whole  doesn't  follow  from  the 
antecedent  either.30 

We  can  then  decide  an  arbitrary  first  degree  entailment  by  this  reduction 
process.  Given  an  entailment,  reduce  it  to  a  collection  of  primitive  entail- 
ments, and  then  the  original  entailment  is  valid  if  and  only  if  each  of  the  prim- 
itive entailments  is  valid.  Let's  apply  this  to  the  inference  of  disjunctive  syllo- 
gism: (A  V  B)  A  ~A  — >  B.  Distributing  the  disjunction  over  the  conjunction  in 
the  antecedent,  we  get  (A  A  ~A)  V(BA  ~A)  — >  B.  This  is  a  valid  entailment  if 
and  only  if  A  A  ~A  — >  B  and  BA-A^B  both  are.  The  second  is,  but  the  first 
is  not.  Disjunctive  syllogism  is  therefore  rejected  by  Anderson  and  Belnap.  To 
accept  it  as  a  valid  entailment  is  to  accept  A  A  ~A  — >  B  as  valid.  Since  this  is  a 
fallacy  of  relevance,  so  is  disjunctive  syllogism. 

This  is  one  simple  characterisation  of  first  degree  entailments.  Once  we 
start  looking  at  models  we  will  see  some  different  models  for  first  degree  en- 
tailment which  give  us  other  straightforward  characterisations  of  the  first- 
degree  fragment  of  R  and  E.  Now,  however,  we  must  consider  how  to  graft 
this  account  together  with  the  account  of  implicational  logics  we  have  already 
seen. 

2.3.3   Putting  them  together 

To  add  the  truth  functional  connectives  to  a  Hilbert  system  for  R  or  E,  An- 
derson and  Belnap  used  the  axioms  due  to  Ackermann  for  his  system  17'.  The 
conjunction  introduction  and  elimination,  disjunction  introduction  and  elim- 
ination axioms,  together  with  distribution  and  the  rule  of  adjunction  is  suffi- 
cient to  add  the  distributive  lattice  connectives.  To  add  negation,  you  add  the 
double  negation  axioms  and  contraposition,  and  counterexample  (or  equiva- 
lently  reductio) .  Adding  the  truth  functions  to  a  Hilbert  system  is  straightfor- 
ward. 

It  is  more  interesting  to  see  how  to  add  the  connectives  to  the  natural  de- 
duction system,  because  these  systems  usually  afford  a  degree  of  separation 
between  different  connectives,  and  they  provide  a  context  in  which  you  can 
see  the  distinctive  behaviour  of  those  connectives.  Let's  start  with  negation. 
Here  are  the  negation  rules  proposed  by  Anderson  and  Belnap: 

o  (~I)  From  ~Aa  proved  under  the  hypothesis  A{lcJ,  deduce  ~Aa_{k}  (if  k  e 
a).  (This  discharges  the  hypothesis.) 

30I  am  not  here  applying  the  fallacious  condition  that  B]  VB2  follows  from  A  if  and  only  if  B 1 
follows  from  A  or  B2  follows  from  A,  which  is  invalid  in  general.  Let  A  be  B 1  V  B2 ,  for  example. 
But  in  that  case  we  note  that  B 1  follows  from  some  disjunct  of  A  and  B2  also  follows  from  other 
disjunct  of  A.  In  the  atomic  case,  A  can  no  longer  be  split  up. 

Classically  the  idea  to  show  the  entailment  p  — *  q  V  ~q  the  idea  would  be  to  import  the 
tautologous  q  V  ~q  into  the  antecedent,  to  get  p  A  (q  V  ~q)  — >  q  V  ~q,  distribute  to  get 
(p  A  q)  V  (p  A  ~q)  — »  q  V  ~q,  and  split  to  get  both  p  A  q  — »  q  V  ~q  (which  is  valid,  by  means 
of  q)  and  p  A  ~q  — >  q  V  ~q  (which  is  valid,  by  means  of  ~q).  With  eyes  of  relevance  there's  no 
reason  to  see  the  appeal  for  importing  q  V  ~q  in  the  first  place. 
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o  (Contraposition)  From  Ba  and  ~Bb  proved  under  the  hypothesis  A{k},  de- 
duce ~AaUb_{k}  (if  k  e  b).  (This  discharges  the  hypothesis.) 

o  ( — E)  From  — Aa  to  deduce  Aa. 

These  rules  follow  directly  the  axioms  of  reductio,  contraposition  and  double 
negation  elimination.  They  are  sufficient  to  derive  all  of  the  desired  negation 
properties  of  E  and  R.  Here,  for  example,  is  a  proof  of  the  reductio  axiom. 


1 

A->~A{1] 

hyp 

2 

hyp 

3 

1  reit 

4 

~A(1,2} 

2-3  ->E 

5 

A{1} 

2-4-1 

6 

(A 

->  ~A)  ->  -A 

1-5  ->I 

The  rules  for  conjunction  are  also  straightforward. 

o  (AEi)  From  A  ABQ  to  deduce  Aa. 

o  (AE2)  From  A  ABQ  to  deduce  BQ. 

o  (AI)  From  Aa  and  Ba  to  deduce  A  A  Ba. 

These  rules  mirror  the  Hilbert  axiom  conditions  (which  make  A  a  lattice  join) . 
The  conjunction  entails  both  conjuncts,  and  the  conjunction  is  the  strongest 
thing  which  entails  both  conjuncts. 

We  do  not  have  a  rule  which  says  that  if  A  depends  on  something  and 
B  depends  on  something  else  then  A  A  B  depends  on  those  things  together, 
because  that  would  allow  us  to  do  too  much.  If  we  did  have  a  connective  (use 
"&"  for  this  connective  for  the  moment)  which  satisfied  the  same  elimination 
clause  as  conjunction,  and  which  satisfied  that  liberal  introduction  rule,  it 
would  allow  us  to  prove  the  positive  paradox  in  the  following  way. 


1 

2 
3 
4 
5 
6 
7 


B{2) 

A&B{1>2} 

A{1,2} 

B^  A{1} 
A^  (B  ->  A) 


hyp 
hyp 
1  reit 

2,3  &E 
4  &E  reit 
2-5  -I 
1-6  ->I 


If  we  have  a  connective  with  the  elimination  rules  of  conjunction  (which  we 
surely  require,  if  that  connective  is  to  be  "and"  in  the  traditional  sense)  then 
the  liberal  rules  are  too  strong.  They  would  allow  us  to  take  vacuous  excur- 
sions through  conjunction  introductions  and  elimination,  picking  up  irrele- 
vant indices  along  the  way. 

No,  the  appropriate  introduction  rule  for  a  conjunction  is  the  restricted 
one  which  requires  that  both  conjuncts  already  have  the  same  relevance  la- 
bel. This,  somewhat  surprisingly,  suffices  to  prove  everything  we  can  prove 
in  the  Hilbert  system.  Here,  for  an  example,  is  the  proof  of  the  conjunction 
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introduction  Hilbert  axiom 


1 

(A^B)A(A^C){1} 

hyp 

2 

A^B{1) 

1  AE 

3 

A  — >  Cm  } 

1  AE 

4 

A{2} 

hyp 

5 

A^B{1! 

2  reit 

6 

B{1,2} 

4,5  ->E 

7 

C{1] 

3  reit 

8 

C{1,2} 

4,7  ->E 

9 

BACn,2] 

6,8  AI 

10 

4-9  -I 

11    (A^B)A(A^O^(A^BAC)    1-10  ^1 


From  these  rules,  using  from  the  de  Morgan  equivalence  between  A  V  B  and 
~(~A  A  ~B)  it  is  possible  to  derive  the  following  two  rules  for  disjunction.31 
Unfortunately  these  rules  essentially  involve  the  conditional.  There  seems  to 
be  no  way  to  isolate  rules  which  involve  disjunction  alone. 

o  (VIi )  From  Aa  to  deduce  A  V  BQ. 
o  (VI2)  From  Ba  to  deduce  A  VBa. 

o  (VE)  From  A  — >  Ca  and  B  — >  Ca  and  from  A  V  Bb  to  deduce  CQUb- 

The  most  disheartening  thing  about  these  rules  for  disjunction  (and  about 
the  natural  deduction  system  itself)  is  that  they  do  not  suffice.  They  do  not 
prove  the  distribution  of  conjunction  over  disjunction.  Anderson  and  Belnap 
had  to  posit  an  extra  rule. 

o  (Dist)  From  A  A  (B  V  C)Q  to  deduce  (A  A  B)  V  Ca 

It  follows  that  this  Fitch- style  proof  theory,  while  useful  for  proving  things 
in  R  or  E,  and  while  giving  some  separation  of  the  distinct  behaviours  of  the 
logical  connectives,  does  not  provide  pure  introduction  and  elimination  rules 
for  each  connective.  For  a  proof  theory  which  does  that,  the  world  would  have 
to  wait  until  the  1970s,  and  for  some  independent  work  of  Grigori  Mine  [195, 
197] 32  and  J.  Michael  Dunn  [78]  ,33 

The  fusion  connective  o  plays  a  minor  role  in  early  work  in  the  Anderson- 
Belnap  tradition.34  They  noted  that  it  has  some  interesting  properties  in  R, 
but  that  the  residuation  connection  fails  in  E  if  we  take  A  o  B  to  be  defined 
as  ~(A  — >  ~B).  Residuation  fails  because  ~(A  — >  ~B)  — >  C  does  not  A  — > 
(B  — >  C)  if  we  cannot  permute  antecedents  of  arbitrary  conditionals.  Since  E 
was  their  focus,  fusion  played  little  role  in  their  early  work.  Later,  with  Dunn's 
development  of  natural  algebraic  semantics,  and  with  the  shift  of  focus  to  R, 
fusion  began  to  play  a  more  central  role. 

The  topic  of  finding  a  natural  proof  theory  for  relevant  implication  —  and 
in  particular,  the  place  of  distribution  in  such  a  proof  theory  —  was  a  re- 
curring theme  in  logical  research  in  this  tradition.  The  problem  is  not  re- 

31See  Anderson  and  Belnap's  Entailment  [10,  §23.2]  for  the  details. 

32Then  in  Russia,  and  now  at  Stanford.  He  publishes  now  under  the  name  "Grigori  Mints" 
33 A  graduate  student  of  Nuel  Belnap's. 

34They  call  o  "fusion"  after  trying  out  names  such  as  "cotenability"  or  "compossibility",  con- 
nected with  the  definition  as  ~( A  — >  ~B). 
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stricted  to  Fitch-style  systems.  Dag  Prawitz's  1965  monograph  Natural  De- 
duction [214],  launched  Gentzen-style  natural  deduction  systems  on  to  cen- 
tre stage.  At  the  end  of  the  book,  Prawitz  remarked  that  modifying  the  rules  of 
his  system  would  give  you  a  system  of  relevant  implication.  Indeed  they  do. 
Almost. 

Rules  in  Prawitz's  system  are  simple.  Proofs  take  the  form  of  a  tree.  Some 
rules  simply  extend  trees  downward,  from  one  conclusion  to  another.  Others, 
take  two  trees  and  join  them  into  a  new  tree  with  a  single  conclusion. 

A  A  B       A  A  B       A    B       A^B  A 


A  B  A  A  B  B 

These  rules  have  as  assumptions  any  undischarged  premises  at  the  top  of  the 
tree.  To  prove  things  on  the  basis  of  no  assumptions,  you  need  to  use  rules 
which  discharge  them.  For  example,  the  implication  introduction  rule  is  of 
this  form: 

[A] 


B 


A^  B 

This  indicates  that  at  the  node  for  B  there  is  a  collection  of  open  assumptions 
A,  and  we  can  derive  A  — >  B,  closing  those  assumptions.  Prawitz  hypoth- 
esised that  if  you  modified  his  rules  to  only  allow  the  discharge  of  assump- 
tions which  were  actually  used  in  a  proof,  as  opposed  to  allowing  vacuous 
discharge  (which  is  required  in  the  proof  of  A  — >  (B  — >  A),  for  example), 
you  would  get  a  system  of  relevant  logic  in  the  style  of  Anderson  and  Belnap. 
Keeping  our  attention  to  implication  alone,  the  answer  is  correct.  His  rule 
modification  gives  us  a  simple  natural  deduction  system  for  R. 

However,  Prawitz's  rules  for  relevant  logic  are  less  straightforward  once  we 
attempt  to  add  conjunction.  If  we  keep  the  rules  as  stated,  then  the  conjunc- 
tion rules  allow  us  to  prove  the  positive  paradox  in  exactly  the  same  way  as  in 
the  case  with  &  in  the  Fitch  system.35 

A2  B1 

 (AI) 

A  A  B 

 (AE) 

A 

(1,-»D 


B  ->  A 

(2,-»I) 


A^  (B  ->  A) 

We  must  do  something  to  the  rule  for  conjunction  introduction  to  ban  this 
proof.  The  required  emendation  is  to  only  allow  conjunction  introduction 
when  the  two  subproofs  have  exactly  the  same  open  assumptions.  A  simi- 
lar emendation  is  required  for  disjunction  elimination.  And  then,  once  those 
patches  are  applied,  it  turns  out  that  distribution  is  no  longer  provable  in  the 
system.  (The  intuitionistic  or  classical  proof  of  distribution  in  Prawitz's  sys- 
tem requires  either  a  weakening  in  of  an  irrelevant  assumption  or  a  banned 

35The  superscripts  and  the  line  numbers  pair  together  assumptions  and  the  points  in  the  proof 
at  which  they  were  discharged. 
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conjunction  or  disjunction  move.)  Prawitz's  system  is  no  friendlier  to  distri- 
bution than  is  Fitch's. 

Logics  without  distribution,  such  as  linear  logic  are  popular,  in  part,  be- 
cause of  the  difficulty  of  presenting  straightforward  proof  systems  for  logics 
with  distribution.  In  general,  proof  theories  seem  more  natural  or  straightfor- 
ward doing  without  it.  The  absence  of  distribution  has  also  sparked  debate. 
The  naturalness  or  otherwise  of  a  proof  theory  is  no  argument  in  and  of  itself 
for  the  failure  of  distribution.  See  Belnap's  "Life  in  the  Undistributed  Mid- 
dle" [32]  for  more  on  this  point. 

2.3.4  Embeddings 

One  of  the  most  beautiful  results  of  early  work  on  relevant  logic  is  the  embed- 
ding results  showing  how  intuitionistic  logic,  classical  logic  and  S4  find  their 
home  inside  R  and  E  [7,  176,  180].  The  idea  is  that  we  can  move  to  an  irrele- 
vant conditional  by  recognising  that  they  might  be  enthymemes.  When  I  say 
that  A  D  B  holds  (d  is  the  intuitionistic  conditional),  I  am  not  saying  that 
B  follows  from  A,  I  am  saying  that  B  follows  from  A  together  perhaps  with 
some  truth  or  other.  One  simple  way  to  say  this  is  to  lean  on  the  addition  of 
the  Ackermann  constant  t.  We  can  easily  add  t  to  R  by  way  of  the  following 
equivalences 

A->(t->A]  (t->A)->A 

These  state  that  a  claim  is  true  just  in  case  it  follows  from  t.36  Given  t  we  can 
define  the  enthymematic  conditional  A  d  B  as  follows.  A  d  B  is 

AAt^B 

which  states  that  B  follows  from  A  together  with  some  truth  or  other.  Now, 
A  d  (B  d  A)  is  provable:  in  fact,  the  stronger  claim  A  — >  (B  D  A)  is  provable, 
since  it  follows  directly  from  the  axiom  Ad  (t  d  A).  But  this  is  no  longer 
paradoxical,  since  B  d  A  does  not  state  that  A  follows  from  B.  (The  proof  that 
you  get  precisely  intuitionistic  logic  through  this  embedding  is  a  little  trickier 
than  it  might  seem.  You  need  to  revisit  the  definition  of  intuitionistic  negation 
(write  it  for  the  moment)  in  order  to  show  that  A  A  -A  d  B  holds.37  The 
subtleties  are  to  be  found  in  a  paper  by  Meyer  [180] .) 

The  same  kind  of  process  will  help  us  embed  the  strict  conditional  of  S4 
into  E.  In  E,  t  is  not  only  true  but  necessary  (as  the  necessary  propositions 
are  those  entailed  by  t)  so  the  effect  of  the  enthymematic  definition  in  E  is  to 
get  a  strict  conditional.  The  result  is  the  strict  conditional  of  E.  If  we  define 
A  =#>  B  as  A  A  t  — >  B  in  E,  then  the  A,  V,  =#>  fragment  of  E  is  exactly  the  A,  V, 
fragment  of  S4  [11,  §35]. 

We  can  extend  the  modelling  of  intuitionistic  logic  into  E  if  we  step  further 
afield.  We  require  not  only  the  propositional  atom  t,  but  some  more  machin- 
ery: the  machinery  of  propositional  quantification.  If  we  add  propositional 

36The  first  axiom  here  is  too  strong  to  govern  t  in  the  logic  E,  in  which  case  we  replace  it  by 
the  permuted  form  t  — *  (A  — »  A).  The  claim  t  doesn't  entail  all  truths.  (If  it  did,  then  all  truths 
would  be  provable,  since  t  is  provable.)  Instead,  t  entails  all  identities. 

37You  can't  just  use  the  negation  of  relevant  logic,  because  of  course  we  get  A  D  B  V  ~B,  since 
t->BV~B. 
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quantifiers  Vp  and  3p  to  E38  then  intuitionistic  and  strict  implication  are  de- 
fined as  follows: 

AdB    =df    3p(p  A  (p  A  A  — >  B)) 
A^B    =df    3p (Dp  A  (p  A  A  — >  B ) ) 

An  intuitionistic  conditional  asserts  that  there  is  some  truth,  such  that  it  con- 
joined with  A  entails  B.  A  strict  conditional  asserts  that  there  is  some  neces- 
sary truth,  such  that  it  conjoined  with  A  entails  B. 

Embedding  the  classical  conditional  into  relevant  logic  is  also  possible. 
The  emendation  is  that  not  only  do  we  need  to  show  that  weakening  is  pos- 
sible, but  contradictions  must  entail  everything:  and  we  want  to  attempt  this 
without  introducing  a  new  negation.  The  innovation  comes  from  noticing 
that  we  can  dualise  the  enthymematic  construction.  Instead  of  just  requiring 
an  extra  truth  as  a  conjunct  in  the  antecedent,  we  can  admit  an  extra  false- 
hood as  a  disjunct  in  the  consequent.  The  classical  conditional  (also  written 
"A  d  B")  can  be  defined  like  this 

AAt-s  BVf 

where  f  =  ~t.  Now  we  will  get  AA-AdB  since  A  A  -A  — >  f.39  Anderson  and 
Belnap  make  some  sport  of  material  "implication"  on  the  basis  of  this  defini- 
tion. Note  that  constructive  implication  is  still  genuinely  an  implication  with 
the  consequent  being  what  we  expect  to  conclude.  A  "material"  implication 
is  genuinely  an  implication,  but  you  cannot  conclude  with  the  consequent  of 
the  original  "conditional."  No,  you  can  only  conclude  the  consequent  with  a 
disjoined  Vf.40 

Arguments  about  disjoined  fs  lead  quite  well  into  arguments  over  the  law 
of  disjunctive  syllogism,  and  these  are  the  focus  of  our  next  section. 

2.4   Disjunctive  Syllogism 

We  have  already  seen  that  Ackermann's  system  FT '  differs  from  Anderson  and 
Belnap's  system  E  by  the  presence  of  the  rule  (y).  In  Ackermann's  TV,  we  can 
directly  infer  B  from  the  premises  A  V  B  and  ~A.  In  E,  this  is  not  possible:  for 
E  a  rule  of  inference  from  X  to  B  is  admitted  only  when  there  is  some  corre- 
sponding entailment  from  X  (or  the  conjunction  of  formulas  in  X)  to  A.  As 
disjunctive  syllogism  in  an  entailment 

(AVB)A^A^B 

is  not  present,  Anderson  and  Belnap  decided  to  do  without  the  rule  too.  This 
motivates  a  question.  Does  dropping  the  rule  [y)  change  the  set  of  theorems? 

38And  the  proof  theory  for  prepositional  quantifiers  is  not  difficult  [11,  §30-32]. 

39The  result  can  be  extended  to  the  embed  the  whole  of  S4  into  E  (rather  than  only  its  positive 
fragment  of  S4)  by  setting  A     B  =df  Bp  (Dp  A(pAA-)  B  V~p)). 

40I  suspect  that  the  situation  is  not  quite  so  bad  for  material  implication.  If  one  treats  accep- 
tance and  rejection,  assertion  and  denial  with  equal  priority,  and  if  you  take  the  role  of  implica- 
tion as  not  only  warranting  the  acceptance  of  the  consequent,  given  the  acceptance  of  the  an- 
tecedent but  also  the  rejection  of  the  antecedent  on  the  basis  of  the  rejection  of  the  consequent, 
then  the  enthymematic  definition  of  the  material  conditional  seems  not  so  bad  [236]. 
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Is  there  anything  you  can  prove  with  (y)  that  you  cannot  prove  without  it?  Of 
course  there  are  things  you  can  prove  from  from  hypotheses,  using  (y)  which 
cannot  be  proved  without  it.  In  Ackermann's  system  IT'  there  is  a  straightfor- 
ward proof  for  A,  ~A  B.  In  Anderson  and  Belnap's  E  there  is  no  such  proof. 
However,  this  leaves  the  special  case  of  proofs  from  no  hypotheses.  Is  it  the 
case  that  in  E,  if  h  A  V  B  and  I — A  that  h  B  too?  This  is  the  question  of  the 
admissibility  of  disjunctive  syllogism.  If  disjunctive  syllogism  is  admissible  in 
E  then  its  theorems  do  not  differ  from  the  theorems  of  Ackermann's  IT'. 

2.4.1  A  Proof  of  the  Admissibility  of  Disjunctive  Syllogism 
There  are  four  different  proofs  of  the  admissibility  of  disjunctive  syllogism  for 
logics  such  as  E  and  R.  The  first  three  proofs  [184,  241,  170]  are  due  to  Meyer 
(with  help  from  Dunn  on  the  first,  and  help  from  Routley  on  the  second). 
They  all  depend  on  the  same  first  step,  which  we  will  describe  here  as  the  way 
up  lemma.  The  last  proof  was  obtained  by  Kripke  in  1978  [92].  In  this  section  I 
will  sketch  the  third  of  Meyer's  proofs,  because  it  will  illustrate  two  techniques 
which  have  proved  fruitful  in  the  study  of  relevant  and  substructural  logics.  It 
is  worth  examining  this  result  in  some  detail  because  it  shows  some  of  the 
distinctive  techniques  in  the  metatheory  of  relevant  logics. 

Fact  3  (Disjunctive  Syllogism  is  Admissible  in  E  and  R)  In  both  E  andR, 
ifr  A  V  B  and\-  ~A  then h  B. 

To  present  the  bare  bones  of  the  proof  of  this  result,  we  need  some  definitions. 


Definition  4  (Theories)  A  set  T  of  formulas  is  a  theory  if  whenever  A,  B  e  T 
then  A  A  B  e  T,  and  if  A  h  B  then  if  A  e  T  we  also  have  B  e  T.  Theories  are 
closed  under  conjunction  and  provable  consequence. 

Note  that  theories  in  relevant  logics  are  rather  special.  Nonempty  theories  in 
irrelevant logics  contain  all  theorems,  since  if  A  e  T  and  if  B  is  a  theorem  then 
so  is  A  — >  B  in  an  irrelevant  logic.  In  relevant  logics  this  is  not  the  case,  so 
theories  need  not  contain  all  theorems. 

Furthermore,  since  A  A  -A  — >  B  is  not  a  theorem  of  relevant  logics,  the- 
ories may  be  inconsistent  without  being  trivial.  A  theory  might  contain  an 
inconsistent  pair  A  and  ~A,  and  contain  its  logical  consequences,  without 
the  theory  containing  every  formula  whatsoever. 

Finally,  consistent  and  complete  theories  in  classical  propositional  logic 
respect  all  logical  connectives.  In  particular,  if  A  V  B  is  a  member  of  a  consis- 
tent and  complete  theory,  then  one  of  A  and  B  is  a  member  of  that  theory.  For 
if  neither  are,  then  -A  and  ~B  are  members  of  the  theory,  and  so  is  ~(A  V  B) 
(by  logical  consequence)  contradicting  A  V  B's  membership  of  the  theory.  In 
a  logic  like  R  or  E  it  is  quite  possible  for  A  V  B  and  ~(A  V  B)  to  be  members 
of  our  theory  without  the  theory  becoming  trivial.  A  theory  can  be  complete 
without  respecting  disjunction.  It  turns  out  that  theories  which  respect  dis- 
junction play  a  very  important  role,  not  only  in  our  proof  of  the  admissibility 
of  disjunctive  syllogism,  but  also  in  the  theory  of  models  for  substructural 
logics.  So,  they  deserve  their  own  definition. 
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Definition  5  (Special  Theories)  A  theory  T  is  said  to  be  prime  if  whenever 
A  V  B  e  T  then  either  A  e  T  or  B  e  T.  A  theory  T  is  said  to  be  regular  (with 
respect  to  a  particular  logic)  whenever  it  contains  all  of  the  theorems  of  that 
logic. 

Now  we  can  sketch  the  proof  of  the  admissibility  of  (y) . 

Proof  We  will  argue  by  reductio,  showing  that  there  cannot  be  a  case  where 
A  V  B  and  -A  are  provable  but  B  is  not.  Focus  on  B  first.  If  B  is  not  provable, 
we  will  show  first  that  there  is  a  prime  theory  containing  all  of  the  theorems 
of  the  logic  but  which  still  avoids  B.  This  stage  is  the  Way  Up.  We  may  have 
overshot  our  mark  on  the  Way  Up,  as  a  prime  theory  containing  all  theorems 
will  certainly  be  complete  (as  C  V  ~C  is  a  theorem  in  E  or  R  so  one  of  C  and 
~C  will  be  present  in  our  theory)  but  it  may  not  be  consistent.  If  we  can  have 
a  consistent  complete  prime  theory  containing  all  theorems  but  still  missing 
out  B  we  will  have  found  our  contradiction,  for  since  this  new  theory  contains 
all  theorems,  it  contains  A  V  B  and  ~A.  By  primeness  it  contains  either  A 
or  it  contains  B.  Containing  A  is  ruled  out  since  it  already  contains  -A,  so 
containing  B  is  the  remaining  option.41  So,  the  Way  Down  cuts  down  our 
original  theory  into  a  consistent  and  complete  one.  Given  the  way  up  and  the 
way  down,  we  will  have  our  result.  Disjunctive  syllogism  is  admissible.  □ 

All  that  remains  is  to  prove  both  Way  Up  and  Way  Down  lemmas. 

Fact  6  (Way  Up  Lemma)  lf\f  A,  then  there  is  a  regular  prime  theory  T  such 
that  A  T. 

This  is  a  special  case  of  the  general  pair  extension  theorem,  which  is  so  useful 
in  relevant  and  substructural  logics  that  it  deserves  a  separate  statement  and 
a  sketch  of  its  proof.  To  introduce  this  proof,  we  need  a  new  definition  to  keep 
track  of  formulas  which  are  to  appear  in  our  theory,  and  those  which  are  to 
be  kept  out. 

Definition  7  (I--pairs)  An  ordered  pair  (L,  R)  of  sets  of  formulae  is  said  to  be 
a  \--pair  if  and  only  if  there  are  no  formulas  A] , . . . ,  An  e  LandBi , . . . ,  Bm  e  R 
where  At  A  •  •  •  A  An  h  Bi  V  ■  ■  ■  V  Bm. 

A  helpful  shorthand  will  be  to  write  '/\  Ai  h  V  B,-'  for  the  extended  conjunc- 
tions and  disjunctions.  A  I- -pair  is  represents  a  set  of  formulas  we  wish  to  take 
to  be  true  (those  in  the  left)  and  those  we  wish  to  take  to  be  false  (those  in  the 
right).  The  process  of  constructing  a  prime  theory  will  involve  enumerating 
the  entire  language  and  building  up  a  pair,  taking  as  many  formulas  as  possi- 
ble to  be  true,  but  adding  some  as  false  whenever  we  need  to.  So,  we  say  that 
ah-pair  (L',R')  extends  (L,  R)  ifandonlyifL  D  L'  andR  D  R'.  We  write  this  as 
"(I_,R)  C  (L',R')."  The  end  point  of  this  process  will  be  a  full  pair. 

Definition  8  (Full  I--Pairs)  Ah-pair  (L,  R)  is  a  full\- -pair  ii  andonlyif  LuR 
is  the  entire  language. 

Full  h- pairs  are  important,  as  they  give  us  prime  theories. 

41  Note  here  that  disjunctive  syllogism  was  used  in  the  language  used  to  present  the  proof. 
Much  has  been  made  of  this  in  the  literature  on  the  significance  of  disjunctive  syllogism  [33, 182] . 
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Fact  9  (Prime  Theories  from  Full  I— Pairs)  If  (L,  R)  is  a  full  h  -pair,  L  is  a 
prime  theory. 

Proof  We  need  to  verify  that  L  is  closed  under  consequence  and  conjunc- 
tion, and  that  it  is  prime.  First,  consequence.  Suppose  A  e  L  and  that  Ah  B. 
If  B  ^  L,  then  since  (L,  R)  is  full,  B  e  R.  But  then  Ah  B  contradicts  the  condi- 
tion that  (L,  R)  is  a  h-pair. 

Second,  conjunction.  If  Ai ,  A2  e  x,  then  since  A]  A  A2  h  Ai  A  A2,  and 
(L,  R)  is  a  h-pair,  we  must  have  A]  AA2  ^  y ,  and  since  (L,  R)  is  full,  Ai  AA2  e  L 
as  desired. 

Third,  primeness.  If  A]  V  A2  e  L,  then  if  Ai  and  A2  are  both  not  in  L,  by 
fullness,  they  are  both  in  R,  and  since  A]  V  A2  h  A]  V  A2,  we  have  another 
contradiction  to  the  claim  that  (x,  y )  is  a  h-pair.  Hence,  one  of  Ai  and  A2  is  in 
R,  as  we  wished.  □ 


Fact  10  (Pair  Extension  Theorem)  If  I-  is  the  logical  consequence  relation 
of  a  logic  including  all  distributive  lattice  properties,  then  any  h  -pair  (L,  R)  is 
extended  by  some  full  h  -pair  (L ' ,  R ') . 

To  prove  this  theorem,  we  will  assume  that  we  have  enumerated  the  language 
so  that  every  formula  in  the  language  is  in  the  list  Ci ,  C2 , . . . ,  Cn , . . .  We  will  to 
consider  each  formula  one  by  one,  to  check  to  see  whether  we  should  throw  it 
in  L  or  in  R  instead.  We  assume,  in  doing  this,  that  our  language  is  countable.42 

Proof  First  we  show  that  if  (L,  R)  is  a  h-pair,  then  so  is  at  least  one  of  (L  U 
{C},  R)  and  (L,  R  U  {C}),  for  any  formula  C.  Equivalently,  we  show  that  if  (L  U 
{C},  R)  is  not  a  h-pair,  then  the  alternative,  (L,  R  U  {C}),  is.  If  this  were  not  a 
h-pair  either,  then  there  would  be  some  A  e  f\  L  (the  set  of  all  conjunctions 
of  formulae  from  L)  and  B  e  V  R  where  A  h  B  V  C.  Since  (L  U  {C},  R)  is  not  a 
h-pair,  there  are  also  A'  e  /\  L  and  B' s  V/R  such  that  A'  A  C  h  B '.  But  then, 
AAA'  h  BVC.  But  this  means  that  AAA'  h  (BVC)AA'.  Now  by  distributive 
lattice  properties,  we  then  get  AAA'  h  BV  (A' AC).  ButA' AC  h  B',  so  cut, 
and  disjunction  properties  give  us  AAA'  h  BVB',  contrary  to  the  fact  that 
(L,  R)  is  a  h-pair. 

With  that  fact  in  hand,  we  can  create  our  full  pair.  Define  the  series  of 
h-pairs  (Ln,Rn)  as  follows.  Let  (L0,Ro)  =  (L,R),  and  given  (Ln,Rn)  define 
(Ln+1  ,Rn+i)  in  this  way. 

,,       D      N  _  /  (U  U  {Cnj,  Rn>    if  (Ln  U  {Cn},  Rn)  is  a  h-pair, 
(Ln+1)Kn+1>  -  1  (Ln)RnU{Cn})  otherwise. 

Each  (Ln+i ,  Rn+i )  is  a  h-pair  if  its  predecessor  (Ln,  Rn)  is,  for  there  is  always 
a  choice  for  placing  Cn  while  keeping  the  result  a  h-pair.  So,  by  induction 
on  n,  each  (Ln,  Rn)  is  a  h-pair.  It  follows  then  then  (\Jneuj  Ln,  Un£cu  Rn),  the 
limit  of  this  process,  is  also  a  h-pair,  and  it  covers  the  whole  language.  (If 
(Unecu  Ln>  Unea>  Rri)  is  not  a  h-pair,  then  we  have  some  At  e  \J  Ln  and  some 
Bj  e  U  Rn  such  that  Ai  A  •  •  •  A  Ai  h  Bi  V  •  •  •  V  Bm,  but  if  this  is  the  case,  then 
there  is  some  number  n  where  each  At  is  in  Ln  and  each  Bj  is  in  Rn.  It  would 
follow  that  (Ln,  Rn)  is  not  a  h-pair.  So,  we  are  done.  □ 

42The  general  kind  of  proof  works  for  well-ordered  languages  as  well  as  countable  languages. 
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Belnap  proved  the  Pair  Extension  Theorem  in  the  early  1970s.  Dunn  cir- 
culated a  write-up  of  it  in  about  1975,  and  cited  it  in  some  detail  in  1976  [80] . 
Gabbay  independently  used  the  result  for  first-order  intuitionistic  logic,  also 
in  1976  [106].  The  theorem  gives  us  the  Way  Up  Lemma,  because  if  1/  B,  then 
(Th,  {B})  is  a  h -pair,  where  Th  is  the  set  of  theorems.  Then  this  pair  is  extended 
by  a  full  pair,  the  left  part  of  which  is  a  regular  prime  theory,  avoiding  B. 

Now  we  can  complete  our  story  with  the  proof  of  the  Way  Down  Lemma. 

Proof  We  must  move  from  our  regular  prime  theory  T  to  a  consistent  reg- 
ular prime  theory  T*  C  T.  We  need  the  concept  of  a  "metavaluation."  The 
concept  and  its  use  in  proving  the  admissibility  (y)  is  first  found  in  Meyer's 
paper  from  1976  [170].  A  metavaluation  is  a  set  of  formulas  T*  on  formulas 
defined  inductively  on  the  construction  of  formulas  as  follows: 

o  For  a  propositional  atom  p,  p  e  T*  if  and  only  if  p  e  T; 

o  ~A  g  T*  iff  (a)  A  g  T*,  and  (b)  ~A  e  T; 

o  A  A  B  e  T*  iffbothAeT*andBeT*; 

o  AVB  g  T*  iff  either  A  g  T*  or  B  e  T*; 

o  A  ->  B  e  T*  iff  (a)  if  A  g  T*  then  B  e  T*  and  (b)  A  ->  B  g  T. 

Note  the  difference  between  the  clauses  for  the  extensional  connectives  A 
and  V  and  the  intensional  connectives  — >  and  ~.  The  extensional  connectives 
have  one-punch  rules  which  match  their  evaluation  with  respect  to  truth  ta- 
bles. The  intensional  connectives  are  more  complicated.  They  require  both 
that  the  formula  is  in  the  original  theory  and  that  the  extensional  condition 
holds  in  the  new  set  T* . 

We  will  prove  that  T*  is  a  regular  theory.  Its  primeness  and  consistency  axe 
already  delivered  by  fiat,  from  the  clauses  for  V  and  ~.  The  first  step  on  the 
way  is  a  simple  lemma. 

Fact  1 1  (Completeness  Lemma)  If  A  g  T*  then  A  e  T,  and  if  A  g  T*  then 
-A  e  T. 

It  is  simplest  to  prove  both  parts  together  by  induction  on  the  construction 
of  A.  As  an  example,  consider  the  case  for  implication.  The  positive  part 
is  straightforward:  if  A  — >  B  g  T*  then  A  — >  B  g  T  by  fiat.  Now  suppose 
A  ->  B  g  T*.  Then  it  follows  that  either  A— >B^TorAeT*  and  B  g  T*.  In 
the  first  case,  by  the  completeness  of  T,  -(A  — >  B)  g  T  follows  immediately.  In 
the  second  case,  A  g  T*  (so  by  the  induction  hypothesis,  A  g  T)  and  B  ^  T* 
(so  by  the  induction  hypothesis,  ~B  g  T).  Since  A,~B  I — (A  — >  B)  in  both  R 
and  E,  we  have  ~(A  — >  B)  g  T,  as  desired. 

It  is  also  not  too  difficult  to  check  that  T*  is  a  regular  theory.  First,  T* 
is  closed  under  conjunction  (by  the  conjunction  clause)  and  it  is  detached 
(closed  under  modus  ponens,  by  the  implication  clause).  To  show  that  it  is  a 
regular  theory,  then,  it  suffices  to  show  that  every  axiom  of  the  Hilbert  system 
for  R  is  a  member.  To  give  you  an  idea  of  how  it  goes,  I  shall  consider  two 
typical  cases. 

First  we  check  suffixing:  (A  — >  B)  — >  ((B  — >  C)  — >  (A  — >  C)).  Suppose  it 
isn't  in  T*.  Since  it  is  a  theorem  of  the  logic  and  thus  a  member  of  T,  it  satisfies 
the  intensional  condition  and  so  must  fail  to  satisfy  the  extensional  condition. 
So  A  — >  B  g  T*  and  (B  — >  C)  — >  (A  — >  C)  g  T*.  By  the  Completeness  Lemma, 
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then  A  — >  B  e  T,  and  so  by  modus  ponens  from  the  suffixing  axiom  itself,  we 
have  that  (B  ->  C)  ->  (A  ->  C)  g  T.  So  (B  ->  C)  ->  (A  ->  C)  satisfies  the 
intensional  condition,  and  so  must  fail  to  satisfy  the  extensional  condition: 
B  — >  C  g  T*  and  A  — >  C  ^  T* .  By  similar  reasoning  we  derive  that  A  — >  C 
must  finally  fail  to  satisfy  the  extensional  condition,  i.e.  A  e  T*  and  C  ^  T*. 
But  since  of  A  — >  B  e  T*,  B  — >  C  g  T*,  A  g  T*,  by  the  extensional  condition, 
C  g  T* ,  and  we  have  a  contradiction. 

Second,  check  double  negation  elimination:  — A  — >  A.  Suppose  it  isn't  in 
T* .  Again,  since  it's  a  theorem  of  the  logic  and  thus  a  member  of  T,  if  it  fails  it 
must  fail  the  extensional  condition.  So,  — A  g  T*  but  A  ^  T*.  Since  — A  g  T*, 
by  the  negation  clause,  we  have  both  ~A  ^  T*  and  — A  g  T.  From  — A  g  T, 
using  double  negation  elimination,  we  get  A  g  T.  Using  the  negation  clause 
again,  unpacking  ~A  0  T* ,  we  have  either  A  g  T*  or  -A  ^  T.  The  first  possibil- 
ity clashes  with  our  assumption  that  A  ^  T*.  The  second  possibility,  ~A  ^  T 
clashes  again  with  A  ^  T* ,  using  the  Completeness  Lemma. 

The  same  techniques  show  that  each  of  the  other  axioms  are  also  present 
in  T*.  Finally  T*  is  closed  under  modus  ponens,  and  as  a  result,  T*  is  a  com- 
plete, consistent  regular  theory,  and  a  subset  of  T.  This  completes  our  proof 
of  the  Way  Down  Lemma.  □ 

Meyer  pioneered  the  use  of  metavaluations  in  relevant  logic  [178,  181]. 
Metavaluations  were  also  used  be  Kleene  in  his  study  of  intuitionistic  theo- 
ries [143,  144],  who  was  in  turn  inspired  by  Harrop,  who  used  the  technique 
in  the  1950s  to  prove  primeness  for  intuitionistic  logic  [128]. 

There  are  many  different  proofs  of  the  admissibility  of  disjunctive  syllo- 
gism. Meyer  pioneered  the  technique  using  metavaluations,  and  Meyer  and 
Dunn  have  used  other  techniques  [183,  184].  Friedman  and  Meyer  showed 
that  disjunctive  syllogism  fails  in  first-order  relevant  Peano  arithmetic  [103], 
but  that  it  holds  when  you  add  an  infinitary  "omega"  rule.  Meyer  and  I  have 
used  a  different  style  of  metavaluation  argument  to  construct  a  complete 
"true"  relevant  arithmetic  [189].  This  metavaluation  argument  treats  nega- 
tion with  "one  punch"  clause:  -A  g  T*  if  and  only  if  A  ^  T* .  In  this  arithmetic, 
0  =  1  -^0  =  2  is  a  theorem,  as  you  can  deduce  0  =  2  from  0  =  1  by  arithmetic 
means,  while  ~(0  =  2  — >  0  =  1)  is  a  theorem,  as  there  is  no  way,  by  using 
multiplication,  addition  and  identity,  to  deduce  0  =  1  from  0  =  2. 

2.4.2  Interpretation 

A  great  deal  of  the  literature  interpreting  relevant  logics  has  focussed  on  the 
status  of  disjunctive  syllogism.  The  relevantist  of  Belnap  and  Dunn's  essay 
"Entailment  and  Disjunctive  Syllogism"  [33]  is  a  stout  hearted  person  who 
rejects  all  use  of  disjunctive  syllogism.  Belnap  and  Dunn  explain  how  difficult 
it  is  to  maintain  this  line.  Once  you  learn  A  V  B  and  you  learn  ~A,  it  is  indeed 
difficult  to  admit  that  you  have  no  reason  at  all  to  conclude  B.  Stephen  Read 
is  perhaps  the  most  prominent  relevantist  active  today  [223,  224].  Read's  way 
of  resisting  disjunctive  syllogism  is  to  argue  that  in  any  circumstance  in  which 
there  is  pressure  to  conclude  B  from  A  V  B  and  ~A,  we  have  pressure  to  admit 
more  than  A  V  B:  we  have  reason  to  admit  ~A  — >  B,  which  will  licence  the 
conclusion  to  B. 
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Some  proponents  of  relevantists  reject  disjunctive  syllogism  not  merely 
because  it  leads  to  fallacies  of  relevance,  but  because  it  renders  non-trivial 
but  inconsistent  theories  impossible  [186,  238].  The  strong  version  of  this 
view  is  that  inconsistencies  are  not  only  items  of  non-trivial  theories,  they  are 
genuine  possibilities  [217] .  Such  a  view  is  dialetheism,  the  thesis  that  contra- 
dictions are  possible.  Not  all  proponents  of  relevant  logics  are  dialetheists, 
but  dialetheism  has  provided  a  strong  motivation  for  research  into  relevant 
logics,  especially  in  Australia.43 

My  view  on  this  issue  differs  from  both  the  relevantist,  the  dialetheist  and 
the  classicalist  (who  accepts  disjunctive  syllogism,  and  hence  rejects  relevant 
logic)  pluralistic  [22,  233].  Disjunctive  syllogism  is  indeed  inappropriate  to 
apply  to  the  content  of  inconsistent  theories.  However  it  is  impossible  that 
the  premises  of  a  disjunctive  syllogism  be  true  while  if  the  very  same  time 
the  conclusion  is  false.  Relevant  entailment  is  not  the  only  constraint  under 
which  truth  may  be  regulated.  Relevant  entailment  is  one  useful  criterion 
for  evaluating  reasoning,  but  it  is  not  the  only  one.  If  we  are  given  reason  to 
believe  A  V  B  and  reason  to  believe  ~A,  then  (provided  that  these  reasons  to 
do  not  conflict  with  one  another)  we  have  reason  to  believe  B.  This  reason  is 
not  one  licensed  by  relevant  consequence,  but  relevant  consequence  is  not 
the  only  sort  of  licence  to  which  a  good  inference  might  aspire. 

Debate  over  disjunctive  syllogism  has  motivated  interesting  formal  work 
in  relevant  logics.  If  you  take  the  lack  of  disjunctive  syllogism  to  be  a  fault 
in  relevant  logics,  you  can  always  add  a  new  negation  (say,  Boolean  negation, 
written '— ')  which  satisfies  the  axioms  AA— A  — >  B  and  A  — >  BV— B.  Then  rel- 
evant logics  are  truly  systems  of  modal  logic  extending  classical  propositional 
logic  with  two  modal  or  intensional  operators,  ~  (a  one-place  operator)  and 
— >  (a  two-place  operator).  Meyer  and  Routley  have  presented  alternative  ax- 
iomatisations  of  relevant  logics  which  contain  Boolean  negation  '— ',  and  the 
material  conditional  A  D  B  =df  — AVB,  as  the  primary  connective  [191, 192]. 

2.5   Lambek  Calculus 

Lambek  worked  on  his  calculus  to  model  the  behaviour  of  syntactic  and  se- 
mantic types  in  natural  languages.  He  used  technique  from  proof  theory  [149, 
150]  (as  well  as  techniques  from  category  theory  which  we  will  see  later  [151]) . 
His  techniques  built  on  work  of  Bar-Hillel  [15]  and  Ajdukiewicz  [4]  who  in 
turn  formalised  some  insights  of  Husserl. 

The  logical  systems  Lambek  studied  contain  implication  connectives  and 
a  fusion  connective.  Fusion  in  this  language  is  not  commutative,  so  it  nat- 
urally motivates  two  implication  connectives  — >  and  <— .44  We  get  two  arrow 
connectives  because  we  may  residuate  A  o  B  h  C  by  isolating  A  on  the  an- 

43See  the  Australian  entries  in  the  volume  "Paraconsistent  Logic:  Essays  on  the  Inconsis- 
tent" [221],  for  example  [46,  49,  173,  219,  254]. 

44Lambek  wrote  the  two  implication  connectives  as  "/"  for  — >  and  "\"  for  <— ,  and  fusion  as 
concatenation,  but  to  keep  continuity  with  other  sections  I  will  use  the  notation  of  arrows  and 
the  circle  for  fusion. 
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tecedent,  or  equally,  by  isolating  B. 

Ah  B  ->  C 
AoBhC 
B  h  C  <—  A 

If  A  o  B  has  the  same  effect  as  B  o  A,  then  B  — >  C  will  have  the  same  effect  as 
C  <—  A.  If  B  o  A  differs  from  A  o  B  then  so  will  — >  and  <— . 

One  way  to  view  Lambek's  innovation  is  to  see  him  as  to  motivating  and 
develop  a  substructural  logic  in  which  two  implications  have  a  natural  home. 
To  introduce  this  system,  consider  the  problem  of  assigning  types  to  strings 
in  some  language.  We  might  assign  types  to  primitive  expressions  in  the  lan- 
guage, and  explain  how  these  cold  be  combined  to  form  complex  expressions. 
The  result  of  such  a  task  is  a  typing  judgement  of  the  form  x  lh  A,  indicating 
that  the  string  x  has  the  type  A.  Here  are  some  example  typing  judgements. 


John 

lh 

n 

poor 

lh 

n  — >  n 

John  works 

lh 

s 

works 

lh 

s  <—  n 

must  work 

lh 

s  <—  n 

work 

lh 

i 

must 

lh 

i  — >  (s  <—  n) 

John  work 

lh 

n  o  i 

Types  can  be  atomic  or  complex:  they  form  an  algebra  of  formulas  just  like 
those  in  a  propositional  logic.  Here,  the  judgement  "John  lh  n"  says  that  the 
string  John  has  the  type  n  (for  name,  or  noun).  The  next  judgement  says  that 
poor  has  a  special  compound  type  n  — >  n:  it  converts  names  to  names.  It  does 
this  by  composition.  The  string  poor  has  the  property  that  when  you  prefix  it 
to  a  string  of  type  n  you  get  another  (compound)  string  of  type  n.  So,  poor 
John  has  type  n.  So  does  poor  Jean,  and  poor  Joan  of  Arc  (if  Jean  and  Joan 
of  Arc  have  the  requisite  types).45  Strings  can,  of  course,  be  concatenated 
at  the  end  of  other  strings  too.  The  string  works  has  type  s  <—  n  because 
whenever  you  suffix  a  string  of  type  n  with  works  you  get  a  string  of  type  s  (a 
sentence).  John  works, poor  Joan  works  and  poor  poor  Joan  of  Arc  works 
are  all  sentences,  according  to  this  grammar. 

Typing  can  be  nested  arbitrarily.  We  see  that  must  work  has  type  s  <—  n 
(it  acts  like  works).  The  word  work  has  type  i  (intransitive  infinitive)  so  must 
has  type  i  — >  (s  <—  n).  When  you  concatenate  it  in  front  of  any  string  of  type 
i  you  get  a  String  of  type  s  <—  n.  So  must  play  and  must  subscribe  to  New 
Scientist  also  have  type  s  <—  n,  as  play  and  subscribe  to  New  Scientist 
have  type  i. 

Finally,  compositions  have  types  even  if  the  results  do  not  have  a  pred- 
ifined  simple  type.  John  work  at  least  has  the  type  noi,  as  it  is  a  concatenation 
of  a  string  of  type  n  with  a  string  of  type  i.  The  string  must  work  also  has  type 
(i  — >  (s  <—  n))  o  i,  because  it  is  a  composition  of  a  string  of  type  i  — >  (s  <—  n) 

45According  to  this  definition,  poor  poor  John  and  poor  poor  poor  poor  Joan  of  Arc  are 
also  strings  of  type  n. 
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with  a  string  of  type  i.  Clearly  here  fusion  is  not  commutative.  John  work  has 
type  not,  but  work  John  does  not.  As  a  corollary,  — >  and  <—  differ.  Given  the 
associativity  of  concatenation  of  strings,  fusion  is  associative  too.  Any  string 
oftype  Ao(BoC)  is  oftype  (AoB)oC.  We  can  associate  freely  in  any  direction.46 

Once  we  have  a  simple  type  system  like  this,  typing  inferences  are  then 
possible,  on  the  basis  of  the  interactions  of  the  type-constructors  — >,  <—  and 
o.  One  of  Lambek's  innovations  was  to  notice  that  this  type  system  can  be 
manipulated  using  a  simple  Gentzen-style  consecution  calculus.  This  calcu- 
lus manipulates  consecutions  of  the  form  Ai ,  A2, . . . ,  An  h  B.  We  read  this 
consecution  as  asserting  that  any  string  which  is  a  concatenation  of  strings  of 
type  A] ,  A2,  . .  • ,  An  also  has  type  B.47  A  list  of  types  will  be  treated  as  a  type 
in  my  explanations  below.48 

The  system  is  made  up  of  one  axiom  and  a  collection  of  rules.  The  ele- 
mentary type  axiom  is  the  identity. 

Ah  A 

Any  string  of  type  A  is  of  type  A.  The  rules  introduce  type  constructors  on  the 
left  and  the  right  of  the  turnstile.  Here  are  the  rules  for  the  left-to-right  arrow. 

X,  A  h  B  XhA  Y,B,ZhC 

 H  R]   H  L] 

X  h  A  — >  B  Y,A^B,X,ZhC 

If  you  have  any  string  of  type  X  concatenated  with  a  string  of  type  A  is  a  string 
of  type  B,  then  this  means  that  X  is  of  type  A  — >  B.  Conversely,  if  any  string 
of  type  X  is  also  of  type  A,  and  strings  of  type  Y,  B,  Z  are  also  of  type  C,  then 
strings  of  type  Y,  A  — >  B,  X,  Z  are  also  of  type  C.  Why  is  this?  It  is  because 
strings  of  type  A  — >  B,  X  are  also  of  type  B,  because  they  are  concatenations 
of  a  string  of  type  A  — >  B  to  the  left  of  a  string  of  type  X  (which  also  has  type 
A).  The  mirror  image  of  this  reasoning  motivates  the  right-to-left  conditional 
rules: 

A,  X  h  B  XhA    Y  B,  Z  h  C 

  [<-  R]   [<-  L] 

Xh  B  (- A  Y,X,B^A,ZhC 

The  next  rules  make  fusion  the  direct  object  language  correlate  of  the  comma 
in  the  metalanguage. 

XhA    YhB  X,A,B,YhC 

  [oR]   [oL] 

X,  Y  h  A  o  B  X,AoB,YhC 

Proofs  in  this  system  are  trees  with  consecutions  at  the  nodes,  and  whose 
leaves  are  axioms  of  identity.  Each  step  in  the  tree  is  an  instance  of  one  or 
other  of  the  rules.  Here  is  a  proof,  showing  that  the  prefixing  axiom  holds  in 

46We  can  associate  fusion  freely,  not  the  conditionals.  A  — >  (B  — >  C)  is  not  the  same  type  as 
( A  — >  B)  — >  C,  as  you  can  check. 

47Lambek  used  the  same  notation  (an  arrow)  to  stand  ambiguously  for  the  two  relations  we 
mark  with  lh  and  h  respectively. 

48The  list  constructor  is  the  metalinguistic  analogue  of  the  fusion  connective.  Note  too  that 
"metalinguistic"  here  means  the  metalanguage  of  the  type  language,  which  itself  is  a  kind  of  met- 
alanguage of  the  language  of  strings  which  it  types. 
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rule  form.49 

Ah A  BhB 

ChC  A  — >  B,  A  h  B 

 (- 

A^B,C^A,ChB 


A^B,C^AhC^B 


M  L) 
R) 

MR) 


A^BhfC^A)^(C^B) 
Here  is  another  proof,  which  combines  both  implication  connectives. 

Ah A  BhB 


ChC     A  — >  B,  A  h  B 

C,  (A  — >  B)  <-  C,Ah  B 

 (<- 

(A^B)^C.AhB^C 

(A^B)^ChA-i(B^C) 


L) 
L) 

-R) 

R) 


A  proof  system  like  this  has  a  number  of  admirable  properties.  Most  obvious 
is  the  clean  division  of  labour  in  the  rules  for  each  connective.  Each  rule  fea- 
tures only  the  connective  being  introduced,  whether  in  antecedent  (left)  or 
consequent  (right)  position.  Another  admirable  property  is  the  way  that  for- 
mulas appearing  in  the  premises  also  appear  in  the  conclusion  of  a  rule  (ei- 
ther as  entire  formulas  or  as  subformulas  of  other  formulas) .  In  proof  search, 
there  is  no  need  to  go  looking  for  other  intermediate  formulas  in  the  proof  of 
a  consecution.  These  two  facts  prove  simple  conservative  extension  results. 
Adding  o  to  the  logic  of  <—  and  — >  would  result  in  no  more  provable  consecu- 
tions in  the  original  language,  because  a  proof  of  a  consecution  involving  no 
fusions  could  not  involve  any  fusions  at  all. 

All  of  this  would  be  for  naught  if  the  deduction  system  were  incomplete.  If 
it  didn't  match  its  intended  interpretation,  these  beautiful  properties  would 
be  useless.  One  important  step  toward  proving  that  the  deduction  system  is 
complete  is  proving  that  the  cut  rule  is  admissible.  (Recall  that  a  rule  is  admis- 
sible if  whenever  you  can  prove  the  premises  you  can  prove  the  conclusion: 
adding  it  as  an  extra  rule  does  not  increase  the  stock  of  provable  things.) 

XhA    Y,  A,  Z  h  B 

 [CutA] 

Y,X,Zh  B 

This  is  not  a  primitive  rule  in  our  calculus,  because  adding  it  would  destroy 
the  subformula  property,  and  make  proof  search  intolerably  unbounded.  It 
ought  to  be  admissible  because  of  the  intended  interpretation  of  h.  If  X  h  A, 
every  string  of  type  X  is  also  of  type  A.  If  Y,  A,  Z  h  B,  then  every  string  which  is 
a  concatenation  of  a  Y  an  A  and  a  Z  has  type  B.  So,  given  a  concatenation  of 
a  Y  and  an  X  and  a  Z,  this  is  also  a  type  B  since  the  string  of  type  X  is  a  string 
of  type  A.  The  cut  rule  expresses  the  transitivity  of  the  "every  string  of  type  x 
is  of  type  y  "  relation. 

49There  is  no  sense  at  this  point  in  which  some  type  is  a  theorem  of  the  calculus,  so  we  focus 
on  the  consecution  forms  of  axioms,  in  which  the  main  arrow  is  converted  into  a  turnstile. 
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Fact  12  (Cut  is  admissible  in  the  Lambek  calculus)  IfX  h  A  is  provable 
in  the  Lambek  calculus  with  the  aid  of  the  cut  rule,  it  can  also  be  proved  with- 
out it. 

Proof  Lambek's  proof  of  the  cut  admissibility  theorem  parallels  Gentzen's 
own  [111,  112].  You  take  a  proof  featuring  a  cut  and  you  push  that  cut  up- 
wards to  the  top  of  the  tree,  where  it  evaporates.  So,  given  an  instance  of  the 
cut  rule,  if  the  formula  featured  in  the  cut  is  not  introduced  in  the  rules  above 
the  cut,  you  permute  the  cut  with  the  other  rules.  (You  show  that  you  could 
have  done  the  cut  before  applying  the  other  rule,  instead  of  after.)  Once  that 
is  done  as  much  as  possible,  you  have  a  cut  where  the  cut  formula  was  in- 
troduced in  both  premises  of  the  cut.  If  the  formula  is  atomic,  then  the  only 
way  it  was  introduced  was  in  an  axiom,  and  the  instance  of  cut  is  irrelevant 
(it  has  evaporated:  cutting  Y,  A,  Z  h  B  with  A  h  A  gives  us  just  Y,  A,  Z  h  B).  If 
the  formula  is  not  atomic,  you  show  that  you  could  trade  in  the  cut  on  that 
formula  with  cuts  on  smaller  formulas.  Here  is  an  example  cut  on  the  impli- 
cation formula  A  — >  B  introduced  in  both  left  and  right  branches. 

W.AhB  XhA  Y,B,ZhC 

R]   H  L] 


WhA^B  Y,A-»B,X,ZI- C 

  [CutA^B] 

Y,W,X,Zh  C 

We  can  transform  it  so  that  cuts  occur  on  the  subformulas  of  A  — >  B. 

XhA    W,  A  h  B 

[CutA] 


W,XhB  Y.B.ZhC 
 [CutB] 

YW,x,ZhC 

The  cases  for  the  other  formulas  are  just  as  straightforward.  As  formulas  have 
only  finite  complexity,  and  trees  have  only  finite  height,  this  process  termi- 
nates. □ 

The  result  that  cut  is  admissible  gives  us  a  decision  procedure  for  the  calculus. 


Fact  13  (Decidability  of  the  Lambek  Calculus)  The  issue  of  whether  or 
not  a  consecution  XhA  has  a  proof  is  decidable. 

Proof  To  check  if  X  h  A  is  provable,  consider  its  possible  ancestors  in  the 
Gentzen  proof  system.  There  are  only  finitely  many  ancestors,  each  corre- 
sponding to  the  decomposition  of  one  of  the  formulas  inside  the  consecution. 
(The  complex  cases  are  the  implication  left  rules,  which  give  you  the  option 
of  many  different  possible  places  to  split  the  Y  in  the  antecedent  X,  A  — >  B,  Y 
or  Y,  B  <—  A,  X,  and  the  fusion  right  rule,  which  gives  you  the  choice  of  loca- 
tions to  split  X  in  X  h  A  o  B.)  The  possible  ancestors  themselves  are  simpler 
consecutions,  with  fewer  connectives.  Decision  of  consecutions  with  no  con- 
nectives is  trivial  (X  h  p  is  provable  if  and  only  if  X  is  p)  so  we  have  our  algo- 
rithm by  a  recursion.  □ 

This  decision  procedure  for  the  calculus  is  exceedingly  simple.  Gentzen's  pro- 
cedure for  classical  and  intuitionistic  logic  has  to  deal  with  the  structural  rule 
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of  contraction:50 

X(Y,  Y)  h  A 

—  [Wl] 

X(Y)  h  A 

which  states  that  if  a  formula  is  used  twice  in  a  proof,  it  may  as  well  have  been 
used  once.  This  makes  proof  search  chronically  more  difficult,  as  some  kind 
of  limit  must  be  found  on  how  many  consecutions  might  have  appeared  as 
the  premises  of  the  consecution  we  are  trying  to  prove. 

Sometimes  people  refer  to  the  Lambek  calculus  as  a  logic  without  struc- 
tural rules,  but  this  is  not  the  case.  The  Lambek  calculus  presumes  the  as- 
sociativity of  concatenation.  A  proper  generalisation  of  the  calculus  treats 
antecedent  structures  not  as  lists  of  formulas  but  as  more  general  bunches  for 
which  the  comma  is  a  genuine  ordered-pairing  operation.  In  this  case,  the 
antecedent  structure  A,  (B,  C)  is  not  the  same  structure  as  (A,  B),  C.51  Lam- 
bek's  original  calculus  is  properly  called  Lambek's  associative  calculus.  The 
non-associative  calculus  can  no  longer  prove  the  prefixing  consecution.  (Try 
to  follow  through  the  proof  in  the  absence  of  associativity.  It  doesn't  work.) 
Of  course,  given  a  non-associative  calculus,  you  must  modify  the  rules  for  the 
connectives.  Instead  of  the  rules  with  antecedent  X,  A,  Y  h  B  we  can  have 
X(A)  h  B,  where  "X(A)"  indicates  a  structure  with  a  designated  instance  of  A. 
The  rule  for  implication  on  the  left  becomes,  for  example 

XhA  Y(B)hC 

 H  L] 

Y(A  — >  B,  X)  h  C 

Absence  of  structural  rules  also  makes  other  things  fail.  The  structural  rule  of 
contraction  (W)  is  required  for  the  contraction  consecution.52 

Ah A  BhB 

 ML) 

Ah  A      A  — >  B,  A  h  B 

X((Y,Z),Z))hA   ML) 

— l^-^Jl  [w]       ((A  — >  (A->B),A),A)hB 

x(Y,  z)hc   (w) 

A  — >  (A  — >  B),  A  h  B 

R) 


  A-)(A-)B)hA^B 

50You'll  see  that  the  structural  rule  is  stated  in  generality:  contraction  operates  on  arbitrary 
structures,  in  arbitrary  contexts.  This  is  needed  for  the  cut  elimination  process.  If  we  could 
contract  only  whole  formulas,  then  if  we  wanted  to  push  a  cut  past  the  move  from  X(  A,  A)  h  B 
to  X(  A)  h  B,  where  we  are  cutting  with  C,  D  HA,  the  result  would  require  us  to  somehow  get 
fromX((C,D),  (C,D))  h  BtoX(C,D)  I-  B.  We  cannot  do  this  if  cut  operates  only  on  formulas, 
and  if  associativity  or  commutativity  is  absent. 

51  Non-associative  combination  plays  an  important  role  in  general  grammars,  according  to 
Lambek  [150].  The  role  of  some  conversions  such  as  wh-  constructions  (replacing  names  by 
"who",  to  construct  questions,  etc.)  seem  to  require  a  finer  analysis  of  the  phrase  structure  of 
sentences,  and  seem  to  motivate  a  rejection  of  associativity. 

Commutative  composition  may  also  have  a  place  in  linguistic  analysis.  Composition  of  differ- 
ent gestures  in  sign  language  may  run  in  parallel,  with  no  natural  ordering.  This  kind  of  com- 
position might  be  best  modelled  as  distinct  from  the  temporal  ordered  composition  of  different 
sign  units.  In  this  case,  we  have  reason  to  admit  two  forms  of  composition,  a  situation  we  will  see 
more  of  later. 

52Sometimes  you  see  it  claimed  that  (Wl)  is  required  for  the  contraction  consecution,  this  is 
true  in  the  presence  of  associativity,  but  can  fail  outside  that  context.  The  rule  (Wl)  corresponds 
to  the  validity  of  A  A  ( A  — >  B)  h  B.  It  does  not  correspond  to  the  validity  of  any  consecution  in 
the  — >  only  fragment  of  the  language. 
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The  structural  rule  of  weakening  (K)  is  required  for  the  weakening  axiom, 

A  h  A 

X(Y)hC   (K) 


[K]         A,  B  h  A 


X(Y,2)hC  — :  (— »  R) 

Ah  B  ->  A 

and  the  structural  rule  of  permutation  (C)  gives  the  permutation  axiom. 

BhB  ChC 


Ah  A     B  — >  C,  B  h  C 

X(Yi,(Y2,Z))l-D  (A^(B^C),A),BhChL1 

 [C]   (C) 

X(Y2,(Y,,Z))hD  (A  — >  (B  — >  C),B),Ah  C 

 MR) 

A->(B-iC),BhA^C 

R) 


A  — >  (B  — >  C)  h  B  — >  (A  ^  C) 


Finally  (for  this  brief  excursus  into  the  effect  of  structural  rules)  the  mingle 
rule  (M)  has  been  of  interest  to  the  relevant  logic  community.  It  is  the  con- 
verse of  Wl  contraction,  and  a  special  instance  of  weakening  (K).  It  corre- 
sponds to  the  mingle  consecution  A  h  A  — >  A,  whose  addition  to  R  results 
in  the  well-behaved  system  RM.  We  will  consider  models  of  RM  in  the  next 
section. 

X(Y)hC   (M) 

 [M]        A,  A  h  A 

X(Y,Y)hC   !  (— >  R) 

A  h  A  — >  A 

There  are  many  different  structural  rules  which  feature  in  different  logics  for 
different  purposes.  Table  1  contains  some  prominent  structural  rules.  I  use 
the  notation  X  <=  Y  to  stand  for  the  structural  rule 

Z(X)  h  A 

Z(Y)  h  A 

You  can  replace  Y  by  X  (reading  the  proof  upwards)  in  any  context  in  an  an- 
tecedent. 

This  proliferation  of  options  concerning  structural  rules  leaves  us  with  the 
issue  of  how  to  choose  between  them.  In  some  cases,  such  as  Lambek's  anal- 
ysis of  typing  regimes  on  languages,  the  domain  is  explicit  enough  for  the 
appropriate  structural  rules  to  be  "read  off"  the  objects  being  modelled.  In 
the  case  of  finding  an  appropriate  logic  of  entailment,  the  question  is  more 
fraught.  Anderson  and  Belnap's  considerations  in  favour  of  the  logic  E  are 
by  no  means  the  only  choices  available  for  a  relevantist.  Richard  Sylvan's 
depth  relevant  program  [139,  242]  and  Brady's  constraints  of  concept  contain- 
ment [45, 48]  motivate  logics  much  weaker  than  E.  They  motivate  logics  with- 
out weakening,  commutativity,  associativity  and  contraction. 

Let's  return  to  Lambek,  after  that  excursus  on  structural  rules.  In  one  of 
his  early  papers,  Lambek  considered  adding  conjunction  to  his  calculus  with 
these  rules  [150]. 

XhA    XhB  X(A)hC  X(B)hC 

 [AR]   [AL,]   [AL2] 

XhAAB  X(AAB)hC  X(AAB)hC 
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Label 

Rule 

A  c  c  n  p  i  a  t  i  ^  ri  t~\  7" 

R 

v= 

fX  Yl  7 

TwistpH  A<!<5nrifltivitv 

IVVlOlvU  1  lijijUVjlClLlVliy 

B' 

X  (Y  7) 

\ — 

(Y  XI  7 
l 1  >  AJ  >  *- 

Converse  Associativity 

Bc 

(X,Y),Z 

X,(Y,Z) 

Strong  Commutativity 

C 

(X,Y),Z 

(X,Z),Y 

Weak  Commutativity 

CI 

X,Y 

4= 

Y,X 

Strong  Contraction 

w 

(X,Y),Y 

4= 

X,Y 

Weak  Contraction 

Wl 

x,x 

4= 

X 

Mingle 

M 

X 

4= 

x,x 

Weakening 

K 

X 

4= 

X,Y 

Commuted  Weakening 

K' 

X 

4= 

Y,X 

Table  1:  Structural  Rules 


Adding  disjunction  with  dual  rules  is  also  straightforward. 

XhA  XhB  X(A)hC  X(B)hC 

[VR,]   [VR2]   [AR] 


XhAVB  XhAVB  X(AVB)hC 

Conjunctive  and  disjunctive  types  have  clear  interpretations  in  the  calculus  of 
syntactic  types.  In  English,  and  is  promiscuous.  It  conjoins  sentences,  names, 
verbs,  and  other  things.  It  makes  sense  to  say  that  it  has  a  conjunctive  type 

and  lh  ((ai  — >  di )  <—  di )  A  •  •  •  A  ((an  — )  an)  <—  an) 

for  n  types  at.53  Similarly,  disjunctive  types  have  a  simple  interpretation. 

o  xlh  A  AB  ifandonlyifx  lh  Aandxlh  B. 

o  xlh  A  VB  ifandonlyifx  lh  A  or  xlh  B. 

Lambek's  rules  for  conjunction  and  disjunction  are  satisfied  under  this  inter- 
pretation of  their  behaviour.  Lambek's  rules  are  sound  for  this  interpretation. 

Cut  is  still  admissible  with  the  addition  of  these  rules.  It  is  straightforward 
to  permute  cuts  past  these  rules,  and  to  eliminate  conjunctions  introduced 
simultaneously  by  both.  However,  the  addition  results  in  the  failure  of  distri- 
bution. The  traditional  proof  of  distribution  (in  Figure  2)  requires  both  con- 
traction and  weakening.  This  means  that  the  simple  rules  for  conjunction 
and  disjunction  (in  the  context  of  this  proof  theory,  including  its  structural 
rules)  are  incomplete  for  the  intended  interpretation. 

2. 6   Kripke's  Decidability  Technique  for  R[— > ,  A] 

Lambek's  proof  theory  for  the  calculus  of  syntactic  types  has  a  close  cousin, 
for  the  relevant  logic  R.  Within  a  year  of  Lambek's  publication  of  his  calculus 

53 Actually  it  makes  sense  to  think  of  and  as  having  type  Vp(  (p  — >  p)  <—  p).  However,  proposi- 
tionally  quantified  Lambek  calculus  is  a  wide-open  field.  No-one  that  I  know  of  has  explored  this 
topic,  at  the  time  of  writing. 
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Ah A  BhB  Ah A  ChC 

 (K)   (K)   (K)   (K) 

A,  BhB  A,  BhB  A,  ChC  A,  ChC 

 (AR)   (AR) 

A,  B  h  A  A  B  A,  C  h  A  A  C 

(VR)  — r— — —   . . — — rr  (VR) 


A,  B  h  (A  A  B)  V  (A  A  C)  A,  C  h  (A  A  B)  V  (A  A  C) 

 (VL) 

A,  B  V  C  h  (AAB)  V(AAC) 

(AL) 


A,AA(BVC)h(AAB)V(AAC) 

 (AL) 

A  A  (B  V  C),  A  A  (B  V  C)  h  (A  A  B)  V  (A  A  C) 

 (Wl) 

AA(BVC)  h  (AAB)  V(AAC) 

Figure  2:  Proof  of  Distribution  of  A  over  V 


of  types,  Saul  Kripke  published  a  decidability  result  using  a  similar  Gentzen 
system  for  the  implication  fragments  of  the  relevant  logics  R  and  E  [147]. 
Kripke's  results  extend  without  much  modification  to  the  implication  and 
conjunction  fragments  of  these  logics,  and  less  straightforwardly  to  the  im- 
plication, negation  fragment  [35,  27,  10]  or  to  the  whole  logic  without  dis- 
tribution [174]  (Meyer  christened  the  resulting  logic  LR  for  lattice  R).  I  will 
sketch  the  decidability  argument  for  the  implication  and  conjunction  frag- 
ment R[— >,  A],  and  then  show  how  LR  can  be  embedded  within  R[— >,  A],  ren- 
dering it  decidable  as  well. 

The  technique  uses  the  Gentzen  proof  system  for  R[— >,  A],  which  is  a  ver- 
sion of  the  Gentzen  systems  seen  in  the  previous  section.  It  uses  the  the  same 
rules  for  — >  and  A,  and  it  is  modified  to  make  it  model  the  logic  R.  We  have 
the  structural  rules  of  associativity  and  commutativity  (which  we  henceforth 
ignore,  taking  antecedents  of  consecutions  to  be  multisets  of  formulas) .  We 
add  also  the  structural  rule  Wl  of  contraction.  Cut  is  eliminable  from  this  sys- 
tem, using  standard  techniques.  However,  the  decidability  of  the  system  is 
not  straightforward,  given  the  presence  of  the  rule  Wl.  Wl  makes  proof-search 
fiendishly  difficult.  The  main  strategy  of  the  decision  procedure  for  R[— >,  A] 
is  to  limit  applications  Wl  in  order  to  prevent  a  proof  search  from  running 
on  forever  in  the  following  way:  "Is  p  h  q  derivable?  Well  it  is  if  p ,  p  h  q  is 
derivable.  Is  p ,  p  h  q  derivable?  Well  it  is  if  p ,  p ,  p  h  q  is  . . . " 

We  need  one  simple  notion  before  this  strategy  can  be  explained.  We  will 
say  that  the  consecution  X'  h  A  is  a  contraction  of  X  h  A  just  in  case  X'  h  A 
can  be  derived  from  X  h  A  by  (repeated)  applications  of  the  the  structural 
rules.  (This  means  contraction,  in  effect,  if  you  take  the  structures  X  and  X'  to 
be  multisets,  identifying  different  permutations  and  associations  of  the  for- 
mulas therein.)  Kripke's  plan  is  to  drop  the  Wl,  replacing  it  by  building  into 
the  connective  rules  a  limited  amount  of  contraction. 

More  precisely,  the  idea  is  to  allow  a  contraction  of  the  conclusion  of  an 
connective  rule  only  in  so  far  as  the  same  result  could  not  be  obtained  by  first 
contracting  the  premises.  A  little  thought  shows  that  this  means  no  change 
for  the  rules  (— >  R),  (AL)  and  (AR),  and  that  the  following  is  what  is  needed 
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suffice  to  modify  (— >  L). 

XhA    Y,  B  h  C 

7  ;  [-* L'] 

[X,  Y,  A  — >  B]  h  C 

where  [X,  Y,  A  — >  B]  is  any  contraction  of  X,  Y,  A  — >  B  such  that 

o  A  — >  B  occurs  only  0,  1,  or  2  times  fewer  than  in  X,  Y,  A  — >  B; 

o  Any  formula  other  than  A  — >  B  occurs  only  0  or  1  time  fewer. 

It  is  clear  that  after  modifying  the  system  R[— >,  A]  by  building  some  limited 
contraction  into  (— >  L')  in  the  manner  just  discussed,  the  following  lemma  is 
provable  by  an  induction  on  length  of  derivations: 

Lemma  14  (Curry's  Lemma)  IfaconsecutionX'  h  A  is  a  contraction  of  a  con- 
secution XhA  and  XhA  has  a  derivation  of  length  n,  then  X'  h  A  has  a 
derivation  of  length  no  greater  than  n.54 

This  shows  that  the  modification  of  the  system  leaves  the  same  consecutions 
derivable  (since  the  lemma  shows  that  the  effect  of  contraction  is  retained). 
For  the  rest  of  this  section  we  will  work  in  the  modified  proof  system. 

Curry's  Lemma  also  has  the  corollary  that  every  derivable  consecution  has 
an  irredundant  derivation:  that  is,  a  proof  containing  no  branch  with  a  con- 
secution X'  h  A  below  a  sequent  X  h  A  of  which  it  is  a  contraction. 

Now  we  can  describe  the  decision  procedure.  Given  a  consecution  XhA, 
you  test  for  provability  by  building  a  proof  search  tree:  you  place  above  XhA 
all  possible  premises  or  pairs  of  premises  from  which  XhA  follows  by  one 
of  the  rules.  Even  though  we  have  built  some  contraction  into  one  rule,  this 
will  be  only  a  finite  number  of  consecutions.  This  gives  a  tree.  If  a  proof  of 
the  consecution  exists,  it  will  be  formed  as  a  subtree  of  this  proof  search  tree. 
By  Curry's  Lemma,  the  proof  search  tree  can  be  made  irredundant.  The  tree 
is  also  finite,  by  the  following  lemma. 

Lemma  15  (Konig's  Lemma)  A  tree  with  finitely  branching  tree  with  branches 
of  finite  length  is  itself  finite. 

We  have  already  proved  that  the  tree  is  finitely  branching  (each  consecution 
can  have  only  finitely  many  possible  ancestors) .  The  question  of  the  length  of 
the  branches  remains  open,  and  this  is  where  an  Kripke  proved  an  important 
lemma.  To  state  it  we  need  an  idea  from  Kleene.  Two  consecutions  XhA  and 
X'  h  A  are  cognate  just  when  exactly  the  same  formulas  X  as  in  X'.  The  class 
of  all  consecutions  cognate  to  a  given  consecution  is  called  a  cognation  class. 
Now  we  can  state  and  prove  Kripke's  lemma. 

Lemma  16  (Kripke's  Lemma)  There  is  no  infinite  sequence  of  cognate  conse- 
cutions such  that  no  earlier  consecution  is  a  contraction  of  a  later  consecution 
in  the  sequence. 

54The  name  comes  from  Anderson  and  Belnap  [10],  who  note  that  it  is  a  modification  of  a 
lemma  due  to  Curry  [61],  applicable  to  classical  and  intuitionistic  Gentzen  systems. 
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This  means  that  the  number  of  cognation  classes  occurring  in  any  derivation 
(and  hence  in  each  branch)  is  finite.  But  Kripke's  Lemma  also  shows  that  only 
a  finite  number  of  members  of  each  cognation  class  occur  in  a  branch  (this 
is  because  we  have  constructed  the  complete  proof  search  tree  to  be  irredun- 
dant) .  So  every  branch  is  finite,  and  so  both  conditions  of  Konig's  Lemma 
hold.  It  follows  that  the  complete  proof  search  tree  is  finite  and  so  there  is  a 
decision  procedure.  So,  a  proof  of  Kripke's  Lemma  concludes  our  search  for  a 
decision  procedure  for  R[A,  — >]. 

Proof  This  is  not  a  complete  proof  of  Kripke's  Lemma.  (The  literature  con- 
tains some  clear  expositions  [10,35].)  The  kernel  idea  can  be  seen  in  a  picture. 
As  a  special  case,  consider  consecutions  cognate  to  X,  Y  h  A.  Each  such  con- 
secution can  be  depicted  as  a  point  in  the  upper  right-hand  quadrant  of  the 
plane,  marked  with  the  origin  at  (1 , 1 )  rather  than  (0, 0)  since  X,  Y  h  A  is  the 
minimal  consecution  in  the  cognation  class.  So,  X,X,Y,  Y,  Y,  Y  h  A  is  repre- 
sented as  (2,4):  '2  X  units'  and  '4  Y  units'.  Now  given  any  initial  consecution, 
for  example 

(r0)  X,X,X,Y,YhA 

you  might  try  to  build  an  irredundant  sequence  by  first  inflating  the  number 
of  Ys  (for  purposes  of  keeping  on  the  page  we  let  this  be  to  5  rather  than  3088) . 
But  then,  you  have  to  decrement  number  of  Xs  at  least  by  one.  The  result  is 
depicted  in  Figure  3  for  the  first  two  members  of  the  sequence  F0 ,  H . 


1  2        3        4        5        6  7 


Figure  3:  Descending  Regions 

The  purpose  of  the  intersecting  lines  at  each  point  is  to  mark  off  areas 
(shaded  in  the  diagram)  into  which  no  further  points  of  the  sequence  may  be 
placed.  If  T2  were  placed  as  indicated  at  the  point  (6,5),  it  would  reduce  to 
P0 .  This  this  means  that  each  new  point  must  proceed  either  one  unit  closer 
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to  the  X  axis  or  one  unit  closer  to  the  Y  axis.  After  a  finite  number  of  choices 
the  consecutions  will  arrive  at  one  or  other  of  the  two  axes,  and  then  after 
a  time,  you  will  arrive  at  the  other.  At  that  time,  no  more  additions  can  be 
made,  keeping  the  sequence  irredundant. 

This  proof  sketch  generalises  to  n-dimensional  space,  corresponding  to 
an  initial  consecution  with  n  different  antecedent  parts.  The  only  difficulty  is 
in  drawing  the  pictures.55  □ 

Extending  this  result  to  the  whole  of  R  (without  distribution)  is  not  diffi- 
cult. You  can  amend  the  proof  system  to  manipulate  consecutions  with  struc- 
ture on  the  right  as  well  as  on  the  left.  (I  won't  present  the  modification  of  the 
rules  here  because  they  are  the  same  as  the  rules  for  those  connectives  in  lin- 
ear logic  which  we  will  see  in  a  few  sections  time.)  The  system  will  not  prove 
the  distribution  of  conjunction  over  disjunction,  but  an  explicit  decision  pro- 
cedure for  the  whole  logic  can  be  found.  This  result  is  due  to  Meyer,  and  can 
be  first  found  in  his  dissertation  [174]  from  1966.  Meyer  also  showed  how 
LR  can  be  embedded  in  R[— >,  A]  by  translation.  Meyer's  translation  is  fairly 
straightforward,  but  I  will  not  go  through  the  details  here.56  I  will  sketch  a 
simpler  translation  which  comes  from  the  Vincent  Danos'  more  recent  work 
on  linear  logic  [64,  65] ,  and  which  is  a  simple  consequence  of  the  soundness 
and  completeness  of  phase  space  models.  We  translate  formulas  in  the  lan- 
guage of  LR  into  the  language  of  implication  and  negation  by  picking  a  par- 
ticular distinguished  proposition  in  the  target  language  and  designating  that 
as  f .  Then  we  define  ~  in  the  language  of  R[— >,  A]  by  setting  ~A  to  be  A  — >  f . 
Then  the  rest  of  the  translation  goes  as  follows: 


I  will  not  go  through  the  proof  of  the  adequacy  of  this  translation,  as  we  will 
see  it  when  we  come  to  look  at  phase  spaces.  However,  a  direct  demonstra- 
tion of  its  adequacy  (without  an  argument  taking  a  detour  through  models)  is 
possible.57  Given  this  translation,  any  decision  procedure  for  R[— >,  A]  trans- 
forms into  a  decision  procedure  for  the  whole  of  LR. 

McRobbie  and  Belnap  [168]  have  translated  the  implication  negation  frag- 
ment of  the  proof  theory  in  an  analytic  tableau  style,  and  Meyer  has  extended 
this  to  give  analytic  tableau  for  linear  logic  and  other  systems  in  the  vicinity  of 

55  Meyer  discovered  that  Kripke's  Lemma  is  equivalent  to  Dickson's  Theorem  about  primes: 
Given  any  set  of  natural  numbers  all  of  which  are  composed  out  of  the  first  m  primes  (that  is, 
every  member  has  the  form  p™ 1 p™ 2  ...p£k)ifno  member  of  this  set  has  a  proper  divisor  in  the 
set,  then  the  set  is  finite. 

56The  details  of  the  translation  can  be  found  elsewhere  [81,  94].  The  point  which  makes  the 
translation  a  little  more  complex  than  the  translation  I  use  here  is  the  treatment  of  f  and  its 
negation  t. 

57The  nicest  is  due  to  Danos.  Take  a  proof  of  X  h  Y  in  the  calculus  for  LR  and  translate  it  step 
by  step  into  a  proof  of  X1 ,  ~Yt  h  f .  (Here  ~Yt  is  the  collection  of  the  negations  of  the  translations 
of  each  of  the  elements  of  Y.)  The  translation  here  is  exacdy  what  you  need  to  make  the  rules 
correspond  (modulo  a  few  applications  of  Cut). 


(AAB)' 
(AVB)' 
(AoB)' 
(A->  B)1 


~(~A*  A~Bt) 
A*  ->  B1 


Greg  Restatl,  Greg .  Restaliamq .  edu .  au 


June  23,  2001 


http: //www. phil .mq.edu. au/staf f /grestall/ 


40 


R  [187].  Neither  time  nor  space  allows  me  to  consider  tableaux  for  substruc- 
tural  logics,  save  for  this  reference. 

Some  recent  work  of  Alasdair  Urquhart  has  shown  that  although  R[— >,  A] 
is  decidable,  it  has  great  complexity  [272,  275] :  Given  any  particular  formula, 
there  is  no  primitive  recursive  bound  on  either  the  time  or  the  space  taken  by 
a  computation  deciding  that  formula.  Urquhart  follows  some  work  in  linear 
logic  [156]  by  using  the  logic  to  encode  the  behaviour  of  a  branching  counter 
machines.  A  counter  machine  has  a  finite  number  of  registers  (say,  for  suit- 
able i)  which  each  hold  one  non-negative  integer,  and  some  finite  set  of  possi- 
ble states{say,  q;  for  suitable  j).  Machines  are  coded  with  a  list  of  instructions, 
which  enable  you  to  increment  or  decrement  registers,  and  test  for  registers' 
being  zero.  A  branching  counter  machine  dispenses  with  the  test  instructions 
and  allows  instead  for  machines  to  take  multiple  execution  paths,  by  way  of 
forking  instructions.  The  instruction  qi  +  Tjqk  means  "when  in  q^,  add  1  to 
register  r;  and  enter  stage  qk)"  and  qt  —  rj  qk  means  "when  in  qit  subtract  1  to 
register  r;  (if  it  is  non-empty)  and  enter  stage  qk,"  and  qif  qj  qk  is  "when  in  qit 
fork  into  two  paths,  one  taking  state  q  j  and  the  other  taking  qk." 

A  machine  configuration  is  a  state,  together  with  the  values  of  each  regis- 
ter. Urquhart  uses  the  logic  LR  to  simulate  the  behaviour  of  a  machine.  For 
each  register  n,  choose  a  distinct  variable  Rt,  for  each  state  qj  choose  a  dis- 
tinct variable  Qj.  The  configuration  (qr.ni , . . .  ,rii),  where  ru  is  the  value  of 
rt  is  the  formula 

QioR;'o...o  R™1 

and  the  instructions  are  modelled  by  sequents  in  the  Gentzen  system,  as  fol- 
lows: 

Instruction  Sequent 

qi  +  rjqk  Qt^Qk°Rj 

qi-Tjqk  Qi,Rji-Qk 

qifq,  qk       Qt  r-  Qj  V  Qk 

Given  a  machine  program  (a  set  of  instructions)  we  can  consider  what  is  prov- 
able from  the  sequents  which  code  up  those  instructions.  This  set  of  sequents 
we  can  call  the  theory  of  the  machine.  QioR^'  o  -  •  oR^1  h  QjoR™1  o-  •  -oRj™  is 
intended  to  mean  that  from  state  configuration  (q^ni , . . . , ni)  all  paths  will 
go  through  configuration  (qjjrrii , . . . ,  mi)  after  some  number  of  steps. 

A  branching  counter  machine  accepts  an  initial  configuration  if  when  run 
on  that  configuration,  all  branches  terminate  at  the  final  state  qf ,  with  all  reg- 
isters taking  the  value  zero.  The  corresponding  condition  in  LR  will  be  the 
provability  of 

Qi  o  R['  o  •  •  ■  o  Rj11  h  Qm 

This  will  nearly  do  to  simulate  branching  counter  machines,  except  for  the 
fact  that  in  LR  we  have  A  h  A  o  A.  This  means  that  each  of  our  registers 
can  be  incremented  as  much  as  you  like,  provided  that  they  are  non-zero  to 
start  with.  This  means  that  each  of  our  machines  need  to  be  equipped  with 
every  instruction  of  the  form  qt>0  +  r,-  qt,  meaning  "if  in  state  qt,  add  1  to  r,- , 
provided  that  it  is  already  nonzero,  and  remain  in  state  q^." 

Urquhart  is  able  to  prove  that  a  configuration  is  accepted  in  branching 
counter  machine,  if  and  only  if  the  corresponding  sequent  is  provable  from 
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the  theory  of  that  machine.  But  this  is  equivalent  to  a  formula 

Theory  (M)  At  — >  (Qi  ->  Qm) 

in  the  language  of  LR.  It  is  then  a  short  step  to  our  complexity  result,  given  the 
fact  that  there  is  no  primitive  recursive  bound  on  determining  acceptability 
for  these  machines.  Once  this  is  done,  the  translation  of  LR  into  R^  A  gives  us 
our  complexity  result. 

Despite  this  complexity  result,  Kripke's  algorithm  has  been  implemented 
with  quite  some  success.  The  theorem  prover  Kripke,  written  by  McRobbie, 
Thistlewaite  and  Meyer,  implements  Kripke's  decision  procedure,  together 
with  some  quite  intelligent  proof-search  pruning,  by  means  of  finite  mod- 
els. This  implementation  works  in  many  cases  [263].  Clearly,  work  must  be 
done  to  see  whether  the  horrific  complexity  of  this  problem  in  general  can  be 
transferred  to  results  about  average  case  complexity. 

2. 7  Richer  Structures  Gentzen  systems  for  distribution 

Grigori  Mine  [195,  197]  and  J.  Michael  Dunn  [78]  independently  developed 
a  Gentzen-style  consecution  calculus  for  relevant  logics  in  the  vicinity  of  R. 
As  we  have  seen  in  the  Gentzen  calculus  for  R[— >],  the  distinctive  behaviour 
of  implication  arises  out  of  the  presence  or  absence  of  structural  rules  gov- 
erning the  combination  of  premises.  To  find  a  logic  without  the  paradoxes  of 
implication,  we  are  lead  to  reject  the  structural  rule  of  weakening.  However, 
the  structural  rule  of  weakening  is  required  to  prove  distribution.58  Dunn 
and  Mine's  innovations  were  to  see  a  way  around  this  apparent  impasse.  One 
way  to  think  of  the  problem  is  this:  consider  the  proof  of  distribution  in  Fig- 
ure 2  on  page  36.  Focus  on  the  point  at  which  (VL)  is  applied.  The  proof 
moves  from  A,  B  h  •  •  •  and  A,  C  h  •  •  •  to  A,  B  V  C  h  ■  ■  -.  It  is  this  point 
at  which  some  form  of  distribution  has  just  been  used:  we  have  used  the 
disjunction  rule  inside  a  comma  context.  This  makes  disjunction  distribute 
over  whatever  the  comma  represents.  In  the  case  where  comma  is  the  met- 
alinguistic analogue  of  fusion  (as  it  is  in  these  proof  systems)  we  can  prove 
Ao(BVC)  h  (A  o  B)  V  (A  o  C).  We  cannot  prove  the  distribution  of  ex- 
tensional  conjunction  over  disjunction  simply  because  there  is  no  structure 
able  to  represent  conjunction  in  this  proof  system.59  The  solution  to  provide 
distribution  is  then  to  allow  a  structure  to  represent  extensional  conjunction, 
just  as  there  is  a  structural  analogue  for  intensional  conjunction. 

In  a  proof  system  like  this,  we  define  structures  recursively,  allowing  both 
intensional  and  extensional  conjunction. 

o  A  formula  is  a  structure. 

o  If  X  and  Y  are  structures,  so  is  (X;  Y).  This  is  the  intensional  combination 
of  X  and  Y. 

58At  least,  it  is  required  if  the  proof  is  going  to  be  anything  like  the  proof  of  distribution  in 
standard  Gentzen  systems. 

59That  is  a  simplification.  The  proof  could  be  dualised,  and  work  in  a  proof  system  with  single 
antecedent  and  multiple  consequents,  for  a  dual  intuitionistic  logic.  In  this  case  it  is  the  structure 
for  extensional  disjunction  which  would  distribute  over  the  conjunction  rule.  The  relevant  part 
of  a  proof  would  be  the  move  from  •  ■  ■  h  A,  B  and  •  ■  ■  h  A,  C  to  •  •  •  h  A,  B  A  C,  distributing  a 
conjunction  over  a  disjunction  again. 
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o  If  X  and  Y  are  structures,  so  is  (X,  Y).  This  is  the  extensional  combination 
of  X  and  Y. 

Then  all  of  the  traditional  structural  rules  (B,  C,  K,  W)  are  admitted  for  ex- 
tensional combination,  and  only  a  weaker  complement  (say,  omitting  K,  for 
relevant  logic,  or  all  but  associativity,  for  the  Lambek  calculus,  or  some  other 
menu  of  choices  for  some  other  substructural  logic)  are  admitted  for  inten- 
sional  combination. 

The  rules  for  the  connectives  may  remain  unchanged  (apart  from  the  no- 
tational  variation  ";"  for  intensional  combination,  instead  of  ","  which  was 
used  up  until  this  point).  However,  the  rules  for  conjunction  may  be  varied 
to  match  those  for  fusion:  we  can  instead  take  extensional  conjunction  to  be 
explicitly  paired  with  extensional  combination. 

X(A,B)hC  XhA  YhB 

[AL']    [AR'] 


X(AAB)hC  X,Y  h  A  AB 

These  rules  are  admissible,  given  the  original  structure-free  rules,  as  these 
demonstrations  show.60 

X(A,B)hC  YhB 

(AL)   (K) 


X(AAB,B)hC  XhA  Y,  X  h  B 

(AL)   (K)   (C) 


X(A  AB,A  AB)  h  C  X, Y  h  A  X,  Y  h  B 

  (Wl)   (AR) 

X(AAB)hC  X,  Y  h  A  A  B 

The  modified  proof  theory  is  sound  and  complete  for  the  relevant  logic  R  and 
its  neighbours.  The  cut  elimination  proof  works  as  before  —  even  with  richer 
structures,  the  conditions  of  the  cut  elimination  proof  (permutability  of  cut 
with  other  rules,  eliminability  of  matching  principal  formulas)  are  still  satis- 
fied. The  subformula  property  is  also  satisfied,  and  the  proof  theory 

However,  the  beneficial  consequences  of  a  cut-free  Gentzen  system  for  a 
logic  —  its  decidability  —  is  not  always  available.  The  difficulty  is  the  pres- 
ence of  contraction  for  extensional  combination.  This  is  not  surprising,  be- 
cause as  we  will  see  later,  R  is  undecidable.  You  cannot  extract  a  decision  pro- 
cedure from  its  Gentzen  calculus.  However,  in  the  absence  of  expansive  rules 
such  as  W  and  Wl,  a  decision  procedure  can  be  found,  as  Steve  Giambrone 
found  in  the  early  1980s  [114].  Giambrone's  decidability  argument  for  the 
negation-free  fragment  of  R  without  contraction  (which,  we  will  see,  is  equiv- 
alent to  linear  logic  with  distribution  added)  and  also  for  positive  TW.  Ross 
Brady  extended  this  argument  in  the  early  1990s  to  show  that  RW  and  TW 
are  decidable  [47] .  Brady's  technique  involved  extending  the  Gentzen  system 
with  signed  formulas,  to  give  straightforward  rules  for  negation  without  re- 
sorting to  a  multiple  consequent  calculus. 

Other  extensions  to  this  proof  theory  are  possible  for  different  applica- 
tions. Belnap,  Dunn  and  Gupta  extended  Dunn's  original  work  to  model  R 
with  an  S4-style  necessity  [34].  I  have  shown  how  a  system  like  this  one  can 
be  used  to  motivate  an  extension  of  the  Lambek  calculus  which  is  sound  and 


60The  converse  proofs,  to  the  effect  that  (AL)  and  (AR)  are  admissible  in  the  presence  of  (AL') 
and  (AR')  are  just  as  simple. 
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complete  for  its  intended  interpretation  on  conjunction  and  disjunction  on 
frame  models  [228]  (unlike  the  structure-free  rules  which  Lambek  originally 
proposed). 

The  natural  deduction  analogue  of  the  Gentzen  system  has  been  the  focus 
of  much  attention,  too.  Read  uses  the  natural  deduction  system  as  the  basis 
of  his  presentation  of  R  in  his  Relevant  Logic  [224].  Slaney,  in  an  influential 
article  from  1990  [256]  gives  a  philosophical  defence  of  the  two  different  sorts 
of  bunching  operators,  characterising  extensional  combination  of  bodies  of 
information  as  a  monotonic  lumping  of  information  together,  while  taking 
intensional  combination  of  X  with  Y  (that's  X;  Y)  as  the  application  of  X  to  Y. 
This  distinction  motivates  the  rules  for  implication  (X  h  A  — >  B  iff  X;  A  h  B: 
A  — >  B  follows  from  X  just  when  whenever  you  apply  X  to  A,  the  resulting 
information  gives  B). 

O'Hearn  and  Pym  call  this  kind  of  proof  theory  the  logic  of  bunched  im- 
plications [204] ,  and  they  use  it  to  model  computation. 

2.8   Display  Logic 

Nuel  Belnap's  Display  Logic  [30]  is  a  neat,  uniform  method  for  providing  a 
cut-free  consecution  calculus  for  a  wide  range  of  formal  systems.  The  cen- 
tral ideas  of  Belnap's  Display  Logic  are  simple  and  elegant.  Like  other  con- 
secution proof  theories,  the  calculus  deals  with  structured  collections  of  for- 
mulas, consecutions.  In  display  logic,  consecutions  are  of  the  form  X  h  Y, 
where  X  and  Y  are  structures,  made  up  from  formulas.  Structures  are  made 
up  of  structure-connectives  operating  on  structures,  building  up  structures 
from  smaller  structures,  in  much  the  same  way  as  formulas  are  built  up  by 
formula-connectives.  The  base  level  of  structures  are  the  formulas.  So  far, 
display  logic  is  of  a  piece  with  standard  Gentzen  systems  —  in  traditional 
systems  structures  are  simply  lists,  and  in  the  more  avant  garde  systems  of 
Dunn  and  Mine,  structures  can  be  made  up  of  two  bunching  operators  — 
but  in  Belnap's  work,  structures  can  be  even  richer.  This  richness  is  present 
so  that  consecutions  can  support  the  display  property:  any  substructure  of  a 
consecution  can  be  displayed  to  be  the  entire  antecedent  or  consequent  of 
an  equivalent  consecution.  In  general,  what  is  wanted  is  a  way  to  "unravel"  a 
context  like  so  that  we  can  perform  equivalences  such  as  this: 

X(Y)  h  Z  is  equivalent  to  Y  h  X"1  (Z) 

where  the  Y  inside  the  structure  X(Y)  is  exposed  to  view,  and  the  surrounding 
X( — )  context  is  unravelled.  Once  you  can  do  this,  connective  rules  are  sim- 
ple, because  you  can  assume  that  each  formula  is  displayed  to  be  the  entire 
antecedent  or  consequent  of  a  consecution. 

Belnap's  original  work  on  display  logic  was  motivated  by  the  problem  for 
finding  a  natural  proof  theory  for  relevant  logics  and  their  neighbours.  As 
a  result,  it  is  illustrative  to  see  the  choices  he  made  in  constructing  rules  to 
allow  the  display  of  substructures.  Here  are  some  equivalences  present  in  R 
and  weaker  relevant  logics. 

AhB  +  C         A  h  B 

AoBhC     

  Ao-BhC       ~B  I — A 

A  I — B  +  C  — =  = 
AhC+B       —A  h  B 
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These  equivalences  allow  us  to  "get  under"  the  connectives  in  formulas.  Here, 
the  equivalences  govern  fusion,  fission  and  negation.  In  traditional  Gentzen 
systems,  the  "comma"  is  an  overloaded  operator,  signifying  conjunction  in 
antecedent  position  and  disjunction  in  consequent  position.  That  is,  a  conse- 
cution of  the  form  X  h  Y  is  interpreted  as  saying  something  like:  "if  everything 
in  X  is  true,  something  in  Y  must  be  true."  In  substructural  logics,  this  comma 
(the  one  which  also  governs  the  behaviour  of  implication)  is  interpreted  as  fu- 
sion on  the  left,  and  if  it  appears  on  the  right  at  all,  as  fission.  Belnap  noted 
that  we  could  get  the  display  properly  if  you  add  a  structural  connective  for 
negation.  If  you  write  this  connective  with  an  asterisk,  you  get  the  following 
display  postulates  to  parallel  the  facts  we  have  already  seen,  governing  fusion 
and  fission. 

XhYoZ  XhY 

XoYhZ    — 

==  X  o  *Y  h  Z  *Y  h  *X 
Xh*YoZ  =   

XhZoY  **XhY 

(Belnap  uses  "o"  for  the  structure  connective  which  is  fusion-  and  fission-like, 
and  I  will  follow  him  in  this  notation.)  As  before,  structures  can  be  interpreted 
in  "antecedent"  position  or  in  "consequent"  position.  However,  now  we  can 
have  "o"  representing  fusion  on  the  right  of  the  turnstile,  or  fission  on  the  left, 
because  the  negation  operator  flips  structures  from  one  position  to  another. 
Consider  the  equivalence  of  X  o  Y  h  Z  with  X  h  *  Y  o  Z.  In  the  first  consecution, 
the  structure  Y  is  on  the  left  of  the  turnstile,  but  on  the  second  it  is  on  the  right. 
It  must  have  the  same  content  in  both  cases61  which  means  that  the  structure 
connectives  inside  Y  must  be  interpreted  in  the  same  way.  With  this  caveat, 
the  display  calculus  is  a  straightforward  Gentzen-system  with  structure  con- 
nectives allowing  both  positive  and  negative  information.  The  rules  govern- 
ing the  connectives  are  straightforward  analogs  of  the  traditional  rules,  with 
the  simplification  that  we  can  now  assume  that  principal  formulas  are  the 
entire  antecedent  or  consequent  of  the  consecutions  which  introduce  them. 
Here  are  the  conditional  rules: 

XoAhB         XhA    B  h  Y 

X  h  A  ^  B  A^Bh*XoY 

The  display  postulates  mean  that  the  cut  rule  appropriate  for  a  display  calcu- 
lus can  be  stated  exceedingly  simply: 

XhA    Ah Y 


There  is  no  need  for  a  stronger  rule  placing  the  cut-formula  in  a  context,  be- 
cause we  can  always  assume  that  the  cut  formula  has  been  displayed.  This 
is  an  advance  in  the  proof  theories  of  substructural  logics  because  some  the 
various  strengthenings  of  the  cut  rule,  required  to  prove  the  cut- elimination 
theorem,  are  not  valid  in  some  substructural  systems.62  In  his  original  paper, 

61  It  is  justified  by  the  equivalence  of  A  o  B  h  C  with  A  h  ~B  +  C,  and  in  this  case  B  "means  the 
same  thing"  in  both  cases. 

62The  most  generous  case  of  Mix  —  from  X  h  Y(A)  and  X'(A)  h  Y'  to  some  conclusion, 
where  both  Y(  A)  andX'(A)  involve  multiple  occurrences  of  A  to  ve  eliminated  —  seems  to  have 
no  appropriate  valid  conclusion  in  general  substructural  logics. 
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Belnap  provides  a  list  of  eight  easily  checked  conditions.  If  a  display  proof 
theory  satisfies  these  conditions,  then  Cut  is  admissible  in  the  system.  We 
need  not  go  in  to  the  detail  of  these  conditions  here.63 

Belnap  shows  that  different  logical  systems  can  be  given  by  adding  dif- 
ferent structural  rules  governing  the  display  connectives  —  and  that  further- 
more, the  one  proof  system  can  have  more  than  one  family  of  display  con- 
nectives. This  is  the  parallel  with  the  Dunn-Mine  Gentzen  system  for  logics 
with  distribution.  Belnap  shows  how  you  can  construct  proof  theories  for 
relevant  logics,  modal  logics,  intuitionistic  logic,  and  logics  which  combine 
connectives  from  different  families. 

The  idea  of  using  display  postulates  to  provide  proof  theories  for  different 
connectives  is  not  restricted  to  Belnap's  original  family  featuring  a  binary  op- 
erator o  and  a  unary  *.  Wansing  [285]  extended  Belnap's  original  work  show- 
ing that  a  unary  structure  •  with  display  rules 

•X  h  Y 
Xh»Y 

would  suffice  to  model  normal  modal  logics.  The  corresponding  connective 
rules  for  □  are 

X  h  «A  A  h  X 

 PR]   PL] 

X  h  DA  DA  h  »X 

This  shows  that  □  is  the  object-language  correlate  of  •  in  consequent  posi- 
tion.64 As  a  result,  display  logic  has  been  used  outside  its  original  substruc- 
tural  setting.  Wansing  has  shown  that  display  logic  is  a  natural  home  for 
proof  theory  for  classical  modal  logics  [285,  286] ,  Belnap  has  extended  his 
calculus  to  model  Girard's  linear  logic  [31], 65  Gore  and  I  have  used  the  dis- 
play calculus  to  model  substructural  logics  other  than  those  considered  by 
Belnap  [124,  229],  and  I  have  extracted  some  decidability  results  in  the  vein 
of  Giambrone  and  Brady  [232]. 66 

2.9   Linear  Logic 

Girard,  in  1987,  introduced  linear  logic,  a  particular  substructural  system  that 
allows  commuting  and  reassociating  of  premises,  but  no  contraction  or  weak- 
ening [117].  Perhaps  Girard's  major  innovation  in  linear  logic  is  the  introduc- 
tion of  the  modalities  —  the  exponentials67  which  allow  the  recovery  of  these 

63 1  have  generalised  Belnap's  conditions  for  the  admissibility  of  cut  in  such  a  way  as  to  include 
traditional  consecution  systems  as  well  as  display  logics.  It  remains  unclear  if  this  generalisation 
will  prove  useful  in  practice,  but  it  does  seem  to  be  an  advantage  to  not  have  to  prove  the  cut 
elimination  theorem  again  and  again  for  each  proof  system  you  construct  [234,  Chapter  6] . 

64 Its  partner  in  antecedent  position  is  a  possibility  operator,  but  the  dual  possibility  operator 
which  looks  backwards  down  the  accessibility  relation  for  necessity.  It  is  tied  together  to  □  by  the 
display  postulates  A  h  DB  if  and  only  if  4A  h  B. 

65The  issue  is  the  treatment  of  the  exponentials. 

66However,  Kracht  has  shown  that  in  general,  decidability  results  from  a  display  calculus  are 
not  to  be  expected.  He  has  shown  that  in  general,  it  is  undecidable  whether  a  given  displayed 
modal  logic  is  decidable  [145]. 

67 So  called  because  of  the  equivalence  between  !(A  A  B)  and  !A  o  !B,  and  dually,  between 
?(A  V  B)  and  ?A  +  ?B.  This  also  explains  why  A  and  V  are  the  additives  and  o  and  +  are  the 
multiplicatives  in  the  parlance  of  linear  logic. 
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structural  rules  in  a  limited,  controlled  fashion.  Linear  logic  has  a  straight- 
forward resource  interpretation:  when  premises  and  conclusions  are  taken  to 
be  resources  to  be  used  in  proof,  then  the  absence  of  contraction  indicates 
that  resources  cannot  be  duplicated,  and  the  absence  of  weakening  indicates 
that  resources  cannot  be  simply  thrown  away.  Only  particular  kinds  of  re- 
sources —  those  marked  off  by  the  exponentials  —  can  be  treated  in  this 
manner.  Linear  logic  has  received  a  great  deal  of  attention  in  the  literature 
in  theoretical  computer  science. 


2. 9.1    Gen  tzen  Systems 

The  most  straightforward  proof  theory  for  linear  logic  is  a  consecution  system 
where  consecutions  feature  structure  in  the  antecedent  and  the  consequent: 

Xh  Y,A  X'.AhY' 

a  h  a   :  :  [cm] 

X'.Xh  Y,Y' 

XhA.Y  X,  A  h  Y 

[~L]   [~R] 


X,~AhY  Xh~A,Y 

X,  A  h  Y  X,  B  h  Y  X  h  Y,  A    X  h  Y,  B 

  [AL,  ]    [AL2]   

X,  A  A  B  h  Y  X,  A  A  B  h  Y  X  h  Y,  A  A  B 

X,  A, B  h  Y  X  h  Y,  A  X'hB.Y' 


oL]   [oR] 

X,  A  o  B  h  Y  X,X'hY,AoB,Y' 

X,  A  h  Y    X,  B  h  Y  X  h  Y,  A  X  h  Y,  B 

 [VL]   [VR,]   [VR2] 

X,  A  V  B  h  Y  XhY.AVB  X  h  Y,  A  V  B 

X,  A  h  Y    B.X'hY'  X  h  Y,  A,  B 

  [+L]    [+R] 

X,A  +  B,X'hY,Y'  X  h  Y,  A  +  B 

X  h  Y,  A    X'.BhY'  X,  A  h  B,  Y 

— ;   [->L]    [->R] 

X',X,A^BhY,Y'  X  h  A  — >  B, Y 

Xh  Y 

  [tL]         I-  t  [tR] 

X.th  Y 

Xh  Y 

fh    [fL]    [fR] 

Xh- Y,f 

X(l)hY  XhY(T) 

XhY  X,Ah  Y         !XhA,?Y         X,  !A,  !Ah  Y 

  [K!]    [L!]    [R!]    [WI!] 

X,!AhY        X,!AhY        !Xh!A,?Y  X,!Ah  Y 

XhY  Xh  Y,A  !X,Ah?Y  XhY,?A,?A 

  [K?]    [R?]    [L?]    [WI?] 

Xh?A,Y        XhY,?A         !X,?Ah?Y  XhY,?A 

Girard's  notation  for  the  connectives  differs  from  the  one  we  have  chosen 
here.  Figure  4  contains  a  translation  manual  between  the  three  traditions  we 
have  seen  so  far. 

Linear  logic  has  two  distinctive  features.  First,  the  exponentials,  which  al- 
low the  recovery  of  structural  rules.  Girard  in  fact  discovered  linear  logic  as 
a  decomposition  of  the  intuitionistic  conditional  A  d  B  into  !A  — >  B  in  the 
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ijOnnective 

Here 

Laniueiv 

Liirara 

implication 

A  — }  D 

A    1  R 
A  /  D 

A   ^  R 

A.  — O  D 

Converse  Implication 

R  i  A 
D  < —  A 

R  \  A 
D  \  A 

Pncinn 

J.  UOlvll 

AoB 

A  •  B 

A  ®  B 

Fission 

A  +  B 

A^  B 

Conjunction 

A  A  B 

A&B 

Disjunction 

A  V  B 

a  e  b 

Negation 

-A 

Church  Truth 

t 

1 

Church  Falsehood 

f 

± 

Ackermann  Truth 

T 

T 

Ackermann  Falsehood 

_L 

0 

Of  course 

I 

! 

Why  not 

? 

? 

Figure  4:  Translation  between  notations 


models  of  coherence  spaces,  which  we  shall  see  in  the  next  part  of  this  essay. 
For  now,  it  is  enough  to  get  a  taste  of  this  decomposition.  The  linear  implica- 
tion A  — >  B  indicates  that  one  use  of  A  is  sufficient  to  get  one  instance  of  B. 
The  exponential  is  the  operator  which  licences  arbitrary  re-use  of  resources. 
So,  an  intuitionistic  conditional  says  that  the  consequent  B  can  be  found,  us- 
ing as  many  instances  of  A  as  we  need.  Here  are  the  proofs  of  the  equivalence 
between  !(A  A  B)  and!Ao  !B. 

Ah A  BhB  Ah A  BhB 

(LA)   (LA)   (L!)   (L!) 


AABhA  AABhA  !Ah  A  !BhB 

(L!)   (L!)   (K!)   (K!) 


!(A  AB)  h  A  !(AAB)hB  !A,!BhA  !A,!BhB 

 (R!)   (R!)   (R!) 

!(AAB)  h  !A  !(AAB)h!B  !A,  !B  h  AAB 

  (oR)    (oR) 

!(AAB),!(AAB)h!Ao!B  !A,  !B  h  !(AAB) 

  (Wl!)   (Lo) 

!(A  AB)h!Ao!B  !Ao!Bh!(AAB) 

Vincent  Danos  has  shown  that  this  modelling  of  intuitionistic  logic  can  be 
made  very  intimate  [64,  65].  It  is  possible  to  translate  intuitionistic  logic  into 
linear  logic  in  such  a  way  that  all  intuitionistic  Gentzen  proofs  have  step-by- 
step  equivalent  linear  logic  proofs  of  their  translations. 

Another  distinctive  feature  of  linear  logic  is  the  pervasive  presence  of  du- 
ality in  the  system.  The  presence  of  negation  means  that  other  connectives 
can  be  easily  defined  in  terms  of  their  duals.  On  the  other  hand,  it  is  also  pos- 
sible to  take  negation  as  the  defined  connective  in  the  following  way:  for  each 
atomic  formula  p  pick  out  a  distinguished  atomic  formula  to  be  ~p.  Then  de- 
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fine  -A  for  complex  formulas  as  follows: 


~~A 

is 

A 

~(A  A  B) 

is 

~AV~B 

~(AVB) 

is 

-AA^B 

~T 

is 

_L 

~_L 

is 

T 

-(AoB) 

is 

-A  +  ^B 

~(A  +  B) 

is 

~Ao~B 

~t 

is 

f 

~f 

is 

t 

~!A 

is 

?~A 

~?A 

is 

!^A 

We  also  take  A  — >  B  to  be  defined  as  -A  +  B  (or  if  you  like,  ~(A  o  ~B),  which  is 
literally  the  same  formula  under  this  new  regime).  Together  with  this  aspect 
of  duality,  we  can  also  transpose  consecutions  from  the  multiple  left-right 
variety,  to  a  conclusion  only  system.  We  replace  the  consecution  X  h  Y  with 
the  consecution  I — X,  Y,  where  ~X  is  the  structure  containing  the  negations 
of  all  of  the  formulas  in  X.  Then  formulas  are  introduced  only  in  the  right,  and 
we  get  a  much  simpler  system,  with  one  rule  for  every  connective,  as  opposed 
to  two. 

hX,A  I — A,  Y 

I-  A,  -  A   [Cut] 

hX,Y 

hX,A     hX,B  hX,A  hX,B 

  [A]   [V,]   [V2] 

hX.AAB  hX.AVB  hX.AVB 

hX,A     h  B, Y  hX,A,B 


[+] 


hX,AoB,Y  hX,A  +  B 

ht[t]       -^-[f]  hX,T[T] 
h  X,f 

h?X,A        hX,A  hX  hX,?A,?A 

  [!]    [?]    [K?]    [WI?] 

h?X,!A       hX,?A       hX,?A  hX,?A 

2.9.2  ProofNets 

[[This  section  must  be  added.  Relevant  citations  will  be  from  among  [26,  43, 
57,66,107,117,119]]] 


2.10  Curry-Howard 

Some  logicians  have  found  that  it  is  possible  to  analyse  proofs  more  closely 
by  giving  them  names.  After  all,  if  proofs  are  first-class  entities,  we  will  be 
better-off  if  we  can  distinguish  different  proofs.  I  can  illustrate  this  by  looking 
at  an  example  from  intuitionistic  logic.  The  language  for  describing  proofs 
in  the  intuitionistic  logic  of  the  conditional  and  implication  is  given  by  the  A- 
calculus  with  pairing.  A  term  of  this  calculus  is  built  up  from  variables  x,  y , . . . 
using  the  constructors  (—,  —),  fst  (— ),  snd  (— ),  Ax.M  and  application  (which  we 
write  as  juxtaposition).  A  judgement  is  a  pair  M:A  of  a  term  M  and  a  formula 
A.  Then  in  proofs  in  this  system  we  keep  tabs  on  what  is  going  on  by  building 
terms  up  to  represent  the  ongoing  proof.  We  start  with  the  identity  rule  x:A  h 
x:A.  Then  for  conjunction,  we  reason  as  follows: 

ThM:A  ThN:B  ThM:AAB  ThM:AAB 
P  h  (M,  N):A  A  B        r  h  fst  (M):A  rhsnd(M):B 
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If  M  is  the  proof  of  A  from  F,  and  N  is  the  proof  of  B  from  V,  then  the  pair 
(M,  N)  is  the  proof  of  A  A  B  from  T.  Similarly,  if  M  is  a  proof  of  A  A  B  (from  T) 
then  fst  (M)  (the  "first  part"  of  M)  is  the  proof  of  A  from  F.  Similarly,  snd  (M) 
is  the  proof  of  B  from  f\  For  implication,  we  have  these  rules: 

r  h  M:A  ->  B    A  h  N:A  T,  x:A  h  M:B 

r,  A  h  (MN):B  r  h  Ax.M:A  ->  B 

If  M  is  a  proof  of  A  — >  B,  and  N  is  a  proof  of  A,  then  you  get  a  proof  of  B  by 
applying  MtoN.  So,  this  proof  is  (MN).  Similarly,  if  M  is  a  proof  of  B  from 
P,  x:A,  then  a  proof  of  A  — >  B  is  a  function  from  proofs  of  A  to  the  proof  of  B.  It 
is  of  type  Ax.M.  We  put  these  together  to  get  names  for  more  complex  proofs 

x:A  ->  B  h  x:A  ->  B    y:Ah  y:A 
x:A  — >  B,y:A  h  (xy):B 
y:A  h  Ax.(xy):(A  ->  B)  ->  B 
0  h  Ay.Ax.(xy):A  ->  ((A  ->  B)  ->  B) 

The  term  Ay.Ax.(xy)  encodes  the  shape  of  the  proof.  The  first  step  was  an 
application  of  one  assumption  on  another  (the  term  (xy)).  The  second  was 
the  abstraction  of  the  first  assumption  (Ax),  and  the  last  step  was  the  abstrac- 
tion of  the  second  assumption  (Ay).  The  term  encodes  the  proof.  There  are  a 
number  of  important  features  of  these  terms. 

o  Terms  encoding  proofs  with  no  premises  are  closed.  They  have  no  free 
variables. 

o  More  generally,  if  V  h  M:A  is  provable  and  x  is  free  in  M  then  x  appears 
free  in  T  too. 

o  Proofs  encode  connective  steps,  not  structural  rules.  For  example,  the 
rule  CI  or  C  was  used  in  the  proof  of  A  — >  ((A  — >  B)  — >  B.  It  is  not  encoded 
in  the  term  explicitly.  Its  presence  can  be  seen  implicitly  by  noting  that 
the  variables  x  and  y  are  bound  in  the  opposite  order  to  their  appearance. 

Once  we  have  a  term  system,  we  have  contracting  rules,  which  give  us  the 
behaviour  of  proof  reduction. 

fst(M,N)    ~»  M 
snd(M,N)    ~»  N 
(Ax.M)N    ~»  M.[x:=N] 

These  correspond  to  cutting  the  detours  out  of  proofs.  For  example,  consider 
the  reduction 

ThM:A  ThN:B 
P  h  (M,  N):A  A  B  V  h  M:A 

rhfst(M,N):A 
Or  a  slightly  more  complex  case: 

T,x:A  h  M:B 

r  h  Ax.M:A  — >  B     A  h  N:A  r,  A  h  M[x  :=  N]:B 

r,Ah  (Ax.M)N:B 
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The  term  M[x  :=  N]  indicates  that  the  assumption(s)  marked  x  in  M  are  re- 
placed by  N.  This  matches  the  assumption(s)  A  marked  x  in  V,  x:A  which  are 
replaced  by  the  A  in  the  transformation. 

An  explanation  of  the  Curry-Howard  isomorphism  between  intuitionistic 
logic  and  the  types  of  terms  in  the  A-calculus  is  found  in  Howard's  original  pa- 
per [137].  As  we've  already  heard,  Church's  original  calculus,  the  Al-calculus, 
was  actually  a  model  for  the  implicational  fragment  of  R  and  not  intuitionistic 
logic,  as  Church's  calculus  did  not  allow  the  binding  of  variables  which  were 
not  free  in  the  term  in  question  [54] .  You  eliminate  contraction  if  you  do  not 
allow  a  A  term  to  bind  more  than  one  instance  of  a  variable  at  once.  Similarly, 
you  eliminate  C  if  you  allow  variables  to  be  bound  only  in  the  order  in  which 
they  are  introduced.  Structural  rules  correspond  to  restrictions  on  binding. 
A  helpful  account  of  more  recent  general  work  in  types  and  logic  is  found  in 
Girard,  Lafont  and  Taylor's  Proofs  and  Types  [121],  and  Girard's  monograph 
Proof  Theory  and  Logical  Complexity  [118]. 

Work  on  the  application  of  the  term  calculus  to  substructural  logics,  which 
focuses  on  three  aspects.  First,  on  encoding  the  normalisation  results  (that 
cutting  detours  out  of  proofs  ends,  and  ends  in  a  canonical  "normal"  proof) . 
Second,  on  the  appropriate  term  encoding  of  the  exponentials  of  linear  logic. 
Work  in  this  area  has  not  yet  reached  stability.  The  work  of  Benton,  Bier- 
man,  Hyland  and  de  Paiva  [38,  39,  40]  shows  the  difficulty  present  in  the  area. 
Thirdly,  on  showing  that  the  restrictions  on  A-abstraction  in  substructural 
logics  has  useful  parallels  in  computation  where  resources  may  be  consumed 
by  computation.  Wadler  and  colleagues  show  that  this  kind  of  term  system 
has  connections  with  functional  programming  [160,  278,  279,  281,  280,  282, 
283]. 

2.11    Structurally  Free  Logic 

A  very  recent  innovation  in  the  proof  theory  of  substructural  logics  is  the  ad- 
vent of  structurally  free  logic.  The  idea  is  not  new  —  it  comes  from  a  1976 
essay  by  Bob  Meyer  [171].  However,  the  detailed  exposition  is  new,  dating 
from  1997  [42,  41,  93].  The  motivating  idea  is  simple.  Just  as  free  logicis  free 
from  existential  commitments  and  any  existence  claims  can  be  explicitly  ex- 
amined and  questioned,  so  in  a  structurally  free  logic,  no  structural  rules  are 
present  in  and  of  themselves,  but  structural  rules,  if  applied,  are  marked  in  a 
proof  as  explicit  premises.  So,  structural  rules  are  tagged  with  a  combinator, 
such  as  these  examples: 

W(X,(Y,Z))hA  W((X,Z),Y))hA     £  W((X,Y),Y))hA 

W(((B,X),Y),Z)hAIB1       W(((C,X),Y),Z)hA[C]       W((W,X),Y)hA  ™ 

These  are  the  combinator  versions  of  the  structural  rules  B  (association)  C 
(commutativity)  and  W  (contraction) .  Now  the  conclusions  not  only  feature 
the  structures  as  rearranged:  they  also  feature  a  combinator  marking  the  ac- 
tion of  the  structural  rule.  Proofs  in  this  kind  of  system  then  come  with  "tick- 
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ets"  indicating  which  kinds  of  structural  rules  licence  the  conclusion: 

BhB    Ah A 


(A^B),AhB  ChC 
(AHB),(C-)A,C)hB 

 (B 

((B,A^B),C^A),ChB 

(B,A^B),C^AhC^B 
B,  A  — >  B  h  (C  — >  A)  — >  (C  — >  B) 


B  h  (A  — >  B)  — >  ((C  — >  A)  — >  (C  — >  B)) 


R) 

:-»r) 

R) 


Any  further  explanation  the  workings  of  this  proof  system  for  structurally  free 
logic  brings  us  perilously  close  to  looking  at  models  for  combinatory  logic 
and  the  A-calculus  [16,  172].  I  will  defer  this  discussion  to  the  next  section, 
where  we  broach  the  question  in  a  broader  setting.  What  on  earth  counts  as 
a  model  of  a  substructural  logic? 


3  Models 

Our  focus  so  far  has  been  syntax  and  proof.  Now  we  turn  our  gaze  to  inter- 
pretation. Clearly  we  have  not  been  unconcerned  with  matters  of  interpreta- 
tion thus  far.  We  have  paid  some  attention  to  the  meanings  of  the  connec- 
tives when  we  have  examined  the  kinds  of  inferential  steps  appropriate  for 
sentences  formed  out  of  these  connectives.  According  to  some  views,  in  giv- 
ing these  rules  for  a  connective  we  have  thereby  explicated  their  meanings. 
According  to  other  views,  we  have  merely  cashed  out  a  consequence  of  the 
meanings  of  the  connectives,  meanings  which  are  to  be  found  in  some  other 
way.68  Thankfully,  we  have  no  need  to  adjudicate  such  a  debate  here.  It  is  not 
our  place  to  clarify  the  ultimate  source  of  meaning.  It  is,  however,  our  place 
to  consider  some  of  the  different  kinds  of  interpretations  open  to  logical  sys- 
tems, and  particular,  substructural  logics. 

An  interpretation  of  a  language  maps  the  sentences  in  the  language  onto 
some  kind  of  structure.  There  are  many  possible  kinds  of  interpretations. 
Some  propositions  are  true  and  others  are  not  true.  We  can  interpret  a  lan- 
guage in  the  structure  {t,  f}  of  truth  values  by  setting  the  interpretation  [A] 
of  A  to  be  t  if  A  is  true,  and  f  otherwise.  This  interpretation  is  helpful  in  the 
study  of  logical  consequence  because  of  the  way  it  interacts  with  the  tradi- 
tional propositional  connectives.  A  conjunction  is  true  if  and  only  if  both  of 
the  conjuncts  are  true.  A  disjunction  is  true  if  and  only  if  one  of  the  disjuncts 
is  true.  A  negation  is  true  if  and  only  if  the  negand  is  not  true.  It  follows  that 
[A  A  B],  [A  V  B]  are  functions  of  [A]  and  [B],  in  the  sense  that  once  the  val- 
ues [A]  and  [B]  are  fixed,  the  values  [A  A  B],  [A  V  B]  are  also  fixed.  The  be- 
haviour of  the  operations  of  conjunction,  disjunction  and  negation  on  the  set 
{t,  f]  of  truth  values  goes  some  way  towards  telling  us  the  meanings  of  those 
connectives.  More  than  that,  it  gives  us  an  account  of  the  behaviour  of  logical 

68This  debate  is  between  truth  conditional  [260]  versus  inferentialist  [50]  accounts  of  meaning 
in  philosophy  of  language,  proof  theorists  [118,  73]  and  model  theorists  [25, 136]  in  mathematical 
logic,  and  operational  and  denotational  semantics  in  computer  science  [198] . 
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consequence,  as  the  set  of  truth  values  has  a  natural  order.  We  can  order  the 
set  by  saying  that  f  <  t,  in  the  sense  that  t  is  "more  true"  than  f.  An  argument 
is  {t,  f]-valid  if  no  matter  how  you  interpret  the  propositions  in  the  argument, 
the  conclusion  is  never  any  less  true  than  the  premises.  Or  in  this  case,  you 
never  can  interpret  the  premises  as  true  and  the  conclusion  as  false.  This  is 
the  traditional  truth-table  conception  of  validity. 

The  simple  set  {t,  f}  of  truth  values  is  not  the  only  domain  in  which  a  lan- 
guage can  be  interpreted.  For  example,  we  might  think  that  not  all  proposi- 
tions or  sentences  in  the  language  are  truth-valued.  We  might  interpret  the 
language  in  the  structure  {t,  n,  f},  where  the  true  claims  are  interpreted  as  t, 
the  false  ones  as  f,  and  the  non-truth-valued  sentences  are  interpreted  as  n. 
This  path  leads  one  to  many  valued  logics  [91,  271]. 

However,  one  need  not  interpret  the  domain  of  values  as  truth  values.  For 
one  early  example  of  an  alternative  sort  of  domain  in  which  sentences  can  be 
interpreted,  consider  Frege's  later  philosophy  of  language.  For  the  Frege  of 
the  Grundgesetze  [102]  declarative  sentences  had  a  reference  [Bedeutung)  and 
a  sense  [Sinn).  We  can  interpret  sentences  by  mapping  them  onto  a  domain 
of  senses  and  by  interpreting  the  connectives  as  functions  on  senses.  This  is 
another  "denotational"  semantics  for  declarative  sentences.69 

Different  applications  will  motivate  different  sorts  of  models  and  domains 
of  semantic  values.  In  the  Lambek  calculus  for  syntactic  types,  the  formulas 
can  be  mapped  onto  sets  of  syntactic  strings.  In  this  interpretation,  a  sen- 
tence will  be  modelled  by  the  set  of  strings  (in  the  analysed  language)  which 
have  the  type  denoted  by  the  sentence. 

These  last  two  examples  —  of  possible  worlds  and  of  syntactic  strings  — 
have  similar  structures.  Formulas  are  interpreted  as  sets  of  objects  of  one  kind 
or  other.  These  are  especially  interesting  models,  which  we  will  discuss  in 
detail  soon.  For  now,  however,  I  will  focus  on  the  general  idea  of  interpreting 
logics  in  structures,  for  simple  algebras  are  the  first  port  of  call  when  it  comes 
to  models  of  substructural  logics. 

3.1  Algebras 

The  most  direct  way  to  interpret  a  logic  is  by  a  map  from  the  language  of  the 
logic  into  some  structure.  Such  structures  are  usually  equipped  with  opera- 
tions to  match  the  connectives  in  the  language.  The  interpretation  of  a  com- 
plex formula  is  then  defined  recursively  in  terms  of  the  operations  on  the  in- 
terpretations of  the  atomic  subformulas.  All  of  this  is  standard.  In  this  sec- 
tion, I  will  examine  just  a  few  structures  which  have  proved  to  be  useful  in 
the  study  of  substructural  logics.  Then  in  the  next  section,  I  will  explain  just  a 
few  of  the  theorems  which  can  be  proved  about  substructural  logics  by  using 
these  structures. 

69For  a  modern  interpretation  of  Frege's  ideas,  one  could  consider  a  sense  of  a  claim  to  be 
the  set  of  possible  worlds  in  which  it  is  true.  Now  for  each  sentence  you  have  its  interpretation 
as  some  set  of  possible  worlds.  For  an  account  of  how  this  approach  might  be  philosophically 
productive,  see  Robert  Stalnaker's  Inquiry  [258],  David  Lewis'  On  the  Plurality  of  Worlds  [154]. 

For  recent  work  which  takes  Frege's  talk  of  senses  at  face  value  (and  which  motivates  a  weak 
substructural  logic,  to  boot)  consider  the  paper  "Sense,  Entailment  and  Modus  Ponens"  by  Gra- 
ham Priest  [215]. 


Greg  Restall,  Greg .  Restaliamq .  edu .  au 


June  23,  2001 


http: //www. phil .mq.edu. au/staf f /grestall/ 


53 


3.1.1    Example  Algebras 

Example  17  (BN4)  Perhaps  the  most  simple,  yet  rich,  finite  structure  used 
to  interpret  substructural  logics  is  the  four-valued  lattice  BN4  [79,  28,  29].  ft 
first  came  to  fame  as  a  simple  lattice  sufficient  to  interpret  first  degree  entail- 
ments. Any  valid  first  degree  entailment  is  valid  in  this  structure  (in  a  sense 
to  be  explained  soon)  and  any  invalid  first  degree  entailment  is  invalid  in  this 
structure.  It  is  also  the  source  of  intuitions  in  its  own  right.  The  behaviour 
of  BN4  is  presented  in  the  diagram  and  tables  in  Figure  5.  The  diagram  can 
present  the  behaviour  of  conjunction,  disjunction,  T  and  _L.  The  conjunction 
of  two  elements  is  their  greatest  lower  bound,  their  disjunction,  the  least  up- 
per bound,  T  is  the  top  element  and  _L  is  the  bottom  element.  The  operations 
of  negation  and  implication  and  fusion  are  read  off  the  tables. 


If  you  think  of  the  values  t,  b,  n,  f  as  the  values  "true  only",  "both  true  and 
false",  "neither  true  nor  false"  and  "false  only"  then  the  negation  of  a  set  values 
is  simply  the  set  of  the  negations  of  values  in  that  set.  Implication  is  similarly 
defined.  The  value  "true"  is  in  the  set  a  — >  b  just  when  if  a  is  at  least  "true" 
then  b  is  at  least  "true",  and  if  b  is  at  least  "false"  then  so  is  a.  On  the  other 
hand,  a  conditional  a  — >  b  is  at  least  "false"  if  a  is  at  least  "true"  and  b  is  at 
least  "false."  This  gives  the  implication  table.  The  values  for  fusion  table  are 
given  by  setting  a  o  b  to  be  ~(a  — >  ~b). 

Given  this  definition,  fusion  is  commutative  and  associative,  with  an  iden- 
tity b.  Negation  is  definable  in  terms  of  implication  by  setting  ~a  to  be  a  — >  b. 
So  in  this  algebra,  the  false  constant  f  is  modelled  by  b,  as  is  the  true  constant 
t.  Fusion  is  residuated  by  — >,  and  the  lattice  is  distributive. 

In  this  algebra,  the  order  in  the  diagram  (read  from  bottom  to  top,  and 
written  "<")  models  entailment.  You  can  see  that  a  A  b  always  entails  a,  as 
the  greatest  lower  bound  of  a  and  b  (whatever  a  and  b  might  be)  is  always 
lower  than,  or  equal  to,  a.  In  just  the  same  way,  you  can  show  that  all  of  the 
entailments  of  a  distributive  lattice  hold  for  A  and  V,  that  a  =  — a  (and  so, 
double  negation  elimination  and  introduction  hold)  and  that  the  de  Morgan 
laws,  such  as  -(a  V  b)  =  ~a  A  ~b  also  hold  in  this  structure. 

In  this  lattice,  some  structural  rules  fail:  Wl  is  not  satisfied,  as  n  j£  n  o  n. 
The  K  rule  also  fails,  as  t  o  b  <  b,  and  hence  we  do  not  have  b  <  t  — >  b. 
However,  fusion  is  symmetric  and  associative.  So  BN4  is  a  model  for  linear 
logic  (with  the  addition  of  distribution),  in  the  sense  that  if  A  h  B  holds  in 
linear  logic  plus  distribution,  then  for  any  interpretation  [■]  into  BN4,  we  must 
have  [A]  <  [B]. 

However,  BN4  is  not  a  model  of  R,  for  some  contraction  related  principles 
fail  in  BN4.  For  example,  there  is  an  interpretation  in  which  [A  A  ( A  — >  B)]  ^ 
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Figure  6:  An  Eight-Point  Lattice  for  R 


[B].70 

This  lattice  has  also  been  used  in  the  semantics  of  programming  [101]. 
Interpreting  the  four  values  as  epistemic  states  of  no  information,  positive 
and  negative  information,  and  conflicting  information,  may  be  of  some  help 
in  modelling  states  of  information-bearing  devices. 


Example  18  (An  Eight  Point  Model)  Consider  the  structure  with  the  or- 
der and  fusion  table  shown  in  Figure  6.  This  is  a  model  of  R:  Fusion  is  com- 
mutative (the  table  is  symmetric  about  the  diagonal),  and  associative.  We 
have  x  <  x  o  x,  so  Wl  holds.  The  element  a  is  an  identity  for  fusion.  Negation 
is  defined  by  the  names  of  the  elements  and  the  fact  that  ~  is  a  de  Morgan 
negation.  Setting x  — >  y  =  ~(x  o  ~y)  makes  — >  residuate  fusion. 

We  can  use  this  structure  to  show  that  R  has  the  relevance  property.  Sup- 
pose we  have  two  propositions  A  and  B,  in  the  language  A,  V,  ~,  o  and  — >, 
such  that  there  is  no  atom  shared  between  A  and  B.  Construct  an  evaluation 
[o],  such  that  [pi  is  either  b  or  ~b  for  any  atom  p  in  A,  and  it  is  either  c  or  ~c 
for  any  atom  p  in  B . 

By  induction,  we  can  verify  that  the  value  [A]  is  one  of  b  and  ~b,  and  sim- 
ilarly, the  value  [B]  is  one  of  c  or  ~c.  Therefore,  [A]  ^  [B],  and  since  this  is  a 
model  of  R,  we  have  A  1/  B  in  R,  and  hence,  A  1/  B  in  any  sublogic  of  R. 

Example  19  (Sugihara  Models)  One  can  modify  BN4  in  a  number  of  ways. 
You  can  leave  out  the  value  b,  and  get  Lukasiewicz's  three-valued  logic.  Ex- 
tensions to  Lukasiewicz's  n-valued,  and  infinitary  logics  are  straightforward 
too.  These  systems  all  invalidate  contraction,  but  validate  weakening  and  the 
other  common  contraction-free  structural  rules.  Another  way  of  modifying 
BN4  is  to  leave  out  the  value  n.This  gives  us  the  structure  known  as  RM3,  a 
three-valued  algebra  useful  in  the  study  of  relevant  logics,  because  this  is  a 
model  of  R.  This  simple  three-valued  model  can  be  generalised  to  RMm+i 
for  any  n  as  follows  by  setting  the  domain  of  propositions  to  be  the  numbers 

{-n,  -(u  -  1 ), . . . ,  -1 , 0, 1 , . . . , n -  1 , n} 
70Hint:  set  [A]  =  n  and  [B]  =  f.  Check  for  yourself  that  this  is  a  counterexample. 
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where  we  set  ~q  to  be  —a,  and  — >  and  fusion  are  denned  as  follows: 


ci  — >  b  = 


{ 


-aVb  ifa<b 
-aAb  ifa>b 


a-  b  = 


{ 


aAb  if  a  <  -b 
a  V  b    if  a  >  -b 


Fusion  is  commutative  (verify  by  eye)  and  associative  (verify  by  checking  case 
by  case),  with  identity  0.  Note  that  a  •  a  =  a,  so  the  logic  satisfies  both  W  and 
M  —  this  is  a  model  for  the  logic  RM  discussed  earlier. 

This  model  can  also  be  extended  by  not  stopping  at  — n  or  n  but  by  includ- 
ing all  of  Z,  the  positive  and  negative  integers.  This  infinite  model  captures 
exactly  the  logic  RM  in  the  language  A,  V,  — >,  o,  ~,  t.  The  infinite  model  has  no 
members  fit  for  either  T  or  _L,  but  they  can  be  added  as  oo  and  — oo  without 
disturbing  the  logic  of  the  model. 

Example  20  (The  Integers)  The  integers  feature  in  the  RM  algebra  above. 
The  choice  of  the  interpretation  of  implication  in  that  model  is  only  one  of 
many  different  ways  you  could  go  in  this  structure.  Another  is  to  consider 
addition  as  a  model  for  fusion.  The  residual  for  addition  is  obvious:  it  is  sub- 
traction. X  — >  Y  is  Y  —  X.  This  structure  is  unlike  the  RM  algebra  in  a  number 
of  ways.  First,  W  fails,  as  a  ^  a  +  a  whenever  a  is  negative.  Second,  M  fails 
(and  so,  K  and  K'  do  too)  as  a  +  a  ^  a  whenever  a  is  positive.  However,  C  and 
B  are  satisfied  in  this  structure,  so  we  have  a  structure  fit  for  linear  logic.  In 
particular,  since  the  structure  is  totally  ordered,  we  have  the  distribution  of 
conjunction  over  disjunction,  so  we  have  a  model  for  distributive linear  logic. 

More  interestingly,  (x  — >  y )  — >  y  —  x  for  each  x  and  y.  This  does  not  hold 
in  any  boolean  algebra  or  in  any  other  non-trivial  structure  with  T.  If  T  were 
present,  then  (a  — >  T)  — >  T  =  a  but  T  <  b  — >  T  for  each  b  (including  a  — >  T) 
so  T  <  a. 

It  is  possible  to  define  the  negation  ~a  as  a  — >  0  =  —a.  However,  other 
choices  are  possible.  Taking  b  an  arbitrary  proposition,  we  can  define  -bias 
a  — >  b,  and  the  condition  (x  — >  y)  — >  y  =  x  states,  in  effect,  that  ~b~bi  =  a. 
Double  negation  introduction  and  elimination  holds  for  any  negation  ~b  we 
choose. 

This  model  is  a  way  to  invalidate  simple  consecutions  in  distributive  linear 
logic.  For  example,  can  we  prove  A  — >  (B  — >  C)  h  (A  — >  B)  — >  (A  — >  C)?  If 
this  holds  in  our  structure  we  must  have  (z  —  y)  —  x  <  (z  —  x)  —  [y  —  x),  but 
this  simplifies  to  z  <  (z  —  x)  +  2x  =  z  +  x  (add  x  +  y  to  both  sides).  And 
when  x  >  0  we  have  z  <  z  +  x,  but  if  x  <  0  this  fails.  Similar  manipulations 
can  be  used  to  invalidate  other  consecutions.  However,  some  consecutions 
invalid  in  distributive  linear  logic  do  hold  in  the  integers.  We  have  seen  that 
(A  — >  B)  — >  B  h  A  already.  Another  case  is  t  h  (A  ->  B)  V  (B  ->  A).  So  the 
integers  do  not  give  an  exact  fit  for  distributive  linear  logic. 

(Others  have  been  aware  that  simple  "counting"  mechanisms  can  provide 
a  useful  filter  for  issues  of  validity  in  substructural  logics  [37,  148,  237,  210] .) 

The  logic  here  is  known  as  abelian  logic:  It  was  introduced  by  Meyer  and 
Slaney  who  show  that  it  is  the  logic  of  ordered  abelian  groups  [173]. 

Example  21  (tu  Under  Division)  Using  number  systems  as  structures  gives 
us  rich  mathematical  tradition  upon  which  we  can  build.  However,  the  struc- 
tures we  have  seen  so  far  are  all  totally  ordered:  for  any  x  and  y  either  x  <  y  or 
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y  <  x.  This  is  not  always  desirable  —  it  leads  to  the  truth  of  (A  — >  B)  V  (B  — > 
A).  Now  some  "natural"  orderings  of  numbers  are  total  orders,  but  others 
are  not.  For  example,  take  the  positive  integers,  ordered  by  divisibility.  This 
is  a  partial  ordering  —  indeed,  a  lattice  ordering  —  in  which  join  is  the  low- 
est common  multiple  and  meet  is  the  greatest  common  divisor.  Fusion  has  a 
natural  model  in  multiplication. 

The  lattice  is  distributive,  as  gcd(a, lcm(b, c))  |  lcm(gcd(a,b),gcd(b,c)). 
With  fusion  modelled  as  multiplication,  1  is  the  identity  of  fusion  and  we  have 
a  distributive  lattice-ordered  commutative  monoid  with  a  unit.  Furthermore, 
the  monoid  is  square-increasing  (as  a  |  a2),  so  it  models  the  structural  rules 
ofR[A,V,o,t]. 

How  can  you  model  a  conditional  residuating  fusion?  We  want 


If  y  divides  z,  then  we  can  set  y  — >  z  to  be  z/y .  For  any  x  you  choose,  xy  |  z  if 
and  only  if  x  |  z/y.  However,  if  y  does  not  divide  z,  we  do  not  have  anything 
to  choose,  as  1  is  the  bottom  of  the  order  we  have  thus  far.  To  get  a  residual 
in  every  instance,  we  need  to  add  another  element  to  the  ordering.  It  will  be 
the  lowest  element  in  the  ordering,  so  we  will  call  it  0  (for  reasons  which  will 
become  more  obvious  later).  We  can  by  fiat  determine  that  0  |  x  for  every  x 
in  the  structure,  and  that  x  |  0  only  when  x  —  0.  Conjunction  and  disjunction 
are  as  before,  with  the  addition  that  0  A  x  =  0  and  0  V  x  =  x  for  each  x.  The 
rule  for  implication  is  then: 


Given  0  we  need  to  extend  the  interpretation  of  fusion.  But  this  is  simple 


for  every  x.  So  defined,  the  operation  is  still  order-preserving,  commuta- 
tive, square-increasing  and  with  1  as  the  identity.  This  structure  is  a  model 
for  R[A,  V,  o,  t,  — >].  To  model  the  whole  of  R  we  need  to  model  a  de  Morgan 
negation.  That  requires  an  order-inverting  involution  on  the  structure.  To  do 
this,  we  need  to  introduce  many  more  elements  in  the  structure,  as  no  order- 
inverting  involution  can  be  found  on  what  we  have  here  before  us:  consider 
the  infinite  ascending  chain 


To  negate  each  element  in  the  series  you  must  get  an  infinite  descending 
chain.  Why?  Because  we  need  an  involution:  x  |  y  if  and  only  if  — y  |  — x, 
and  in  particular,  if  —  x  =  —  y ,  then  we  must  have  x  =  y .  Each  element  in  the 
inverted  chain  must  be  distinct.  Alas,  there  are  no  such  chains  in  our  struc- 
ture, as  every  number  has  only  finitely  many  divisors.  So,  we  need  to  add 
more  elements  to  do  the  job.  As  our  notation  has  suggested,  we  will  add  the 
negative  integers  and  oo.  The  order  is  given  by  setting 


xy  |  z  if  and  only  if  x  |  y  — >  z 


otherwise. 


Ox  =  xO  =  0 


0 |  1  |  2 | 4  |  •  • •  |  2n 


0  |  x  |  -y  |  oo 
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for  every  positive  x  and  y,  and  in  particular,  — x  |  —  y  if  and  only  if  y  |  x.  So 
you  can  read  '|'  as  divides  only  when  it  holds  between  positive  integers.  Oth- 
erwise, it  is  defined  by  these  clauses.  The  infinite  ascending  chain  is  then 
mapped  onto  the  infinite  descending  chain  above  it  as  follows: 

0  |  1  |  2  |  4  |  •  •  •  |  2-  |  •  •  •  |  -2-  |  •  •  •  |  -4  |  -2  |  -1  |  oo 

The  result  is  still  a  distributive  lattice  order,  and  conjunction  and  disjunction 
are  obviously  definable  as  greatest  lower  bound  and  least  upper  bound,  re- 
spectively. Implication  between  all  pairs  of  elements  is  defined  as  follows: 

o  If  x  is  negative  and  y  is  positive,  x  — >  y  =  0. 

o  If  x  is  positive  and  y  is  negative,  then  x  — >  y  =  (— x)y 

o  If  x  and  y  are  both  negative,  then  x  — >  y  =  -y  — >  —  x 
Fusion  is  then  defined  by  setting  xy  =  —  (x  — >  —  y ) ,  and  you  can  show  that  this 
is  commutative,  square-increasing  and  with  1  as  the  identity. 

The  lattice  is  not  complete,  in  that  not  every  subset  has  a  least  upper  or  a 
greatest  lower  bound:  The  chain  0  |  1  2  |  •  •  •  |  2n  |  •  •  •  has  an  upper  bound 
(any  negative  number  will  do)  but  no  least  upper  bound. 

This  structure  was  first  constructed  by  Meyer  [177]  in  1970  who  used  it  to 
establish  some  formal  properties  of  R.  The  technique  of  expanding  an  algebra 
to  model  negation  is  one  we  shall  see  again  as  an  important  technique  in  the 
metatheory  of  these  logics. 

Example  22  (Algebras  of  Relations)  A  generalisation  of  Boolean  algebras 
due  to  de  Morgan  [201]  and  Peirce  [209]  and  later  developed,  for  example, 
by  Schroder  [249],  was  to  consider  algebras  of  binary  relations.  A  concrete  re- 
lation algebra  is  the  set  of  all  subsets  of  some  set  D  x  D  of  pairs  of  elements 
from  a  set  D  under  not  only  the  Boolean  operations  of  intersection,  union  and 
complementation  but  also  under  new  operations  which  exploit  the  relational 
structure. 

For  any  two  relations  R  and  S  their  composition  is  also  a  relation:  R  •  S  is 
defined  by  setting  x(R  •  S)y  if  and  only  if  (3z  e  D)(xRz  AzRy).  This  is  a  model 
for  fusion.  Fusion  has  a  left  and  right  identity,  1,  the  identity  relation  on  D. 
Furthermore,  for  any  relation  R  we  have  its  converse,  given  by  setting  xRy  if 
and  only  if  yRx.  Note  that  (R  •  SJ=  S  •  R. 

We  can  define  left  and  right  residuals  for  composition  directly  by  the  resid- 
uation  conditions,  or  we  can  note  that  they  are  definable  in  terms  of  the 
Boolean  connectives,  fusion  and  converse.  R  — >  S  =  —  (— S  •  R)  and  S  <—  R  = 
-(R--S). 

It  is  possible  to  modify  the  behaviour  of  these  algebras  by  considering  re- 
stricted classes  of  relations.  For  example,  we  could  look  at  algebras  of  reflex- 
ive relations.  These  are  odd,  in  that  1  <  R  for  each  R,  so  the  bottom  element 
of  the  algebra  is  also  the  identity  for  fusion.  These  algebras  are  closed  under 
some  of  the  operations  at  issue,  but  not  all.  The  Boolean  complement  of  a 
reflexive  relation  is  not  reflexive,  but  the  conjunction  or  disjunction  of  two 
reflexive  relations  is. 

Another  possibility  is  to  consider,  for  example,  equivalence  relations  [96] . 
When  is  the  composition  of  two  equivalence  relations  an  equivalence  rela- 
tion? It  turns  out  that  R  •  S  is  also  commutative  when  R  •  S  =  S  •  R.  And  if  this 
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obtains,  then  their  composition  is  the  least  upper  bound  of  the  two  relations 
(in  the  set  of  equivalence  relations) .  Therefore,  a  class  of  commuting  equiva- 
lence relations  forms  a  lattice,  in  which  fusion  is  least  upper  bound.  And  it  is 
not  too  hard  to  show  that  this  lattice  generally  fails  to  be  distributive,  but  it  is 
modular.  It  satisfies  the  modular  law 

QA(bV(aAc))<  (oAb)  V(aAc) 

but  not  the  more  general  distributive  law. 

Tarski  [259]  helped  bring  relation  algebras  back  to  prominence  in  modern 
logic,  and  there  is  much  contemporary  research  in  the  area,  particularly  in 
Hungary  [12].  Vaughan  Pratt  has  also  considered  them  (and  dynamic  alge- 
bras, a  tractable  fragment  of  relation  algebras)  as  a  useful  model  of  computa- 
tion [213]. 

3.1.2   General  Structures 

The  algebraic  study  of  models  of  substructural  logics  was  first  explicitly  and 
comprehensively  tackled  by  J.  Michael  Dunn  in  his  doctoral  dissertation  from 
the  middle  1960s  [75].  The  techniques  he  used  are  mostly  standard  ones, 
adapted  to  the  new  context  of  relevant  logics.  There  had  been  a  long  tra- 
dition of  using  finite  algebras  (also  called  'matrices'  for  obvious  reasons)  to 
prove  syntactic  results  about  logics,  such  as  the  relevance  properly  for  R,  as 
we  have  seen.  Section  22  of  Entailment  Volume  1  [10]  contains  a  good  dis- 
cussion of  results  of  this  sort.  However,  it  was  Dunn's  work  that  first  took  such 
structures  as  a  fit  object  of  study  in  their  own  right. 

For  a  helpful  guide  to  the  state  of  the  art  in  the  1970s,  Helena  Rasiowa's  An 
Algebraic  Approach  to  Non-classical  Logics  [222]  is  a  compendium  of  results  in 
the  field.  Meyer  and  Routley's  groundbreaking  paper  "An  Algebraic  Analysis  of 
Entailment"  [190]  did  a  great  deal  of  work  showing  how  a  whole  host  of  logics 
fit  together,  all  with  the  theme  of  residuation  or  the  connection  of  fusion  with 
implication.  They  showed  that  not  only  in  R  but  also  in  other  relevant  logics, 
fusion  is  connected  together  with  implication  by  the  residuation  postulate 

aob<ciffa<b^c 

and  that  the  natural  way  to  ring  the  changes  in  the  logic  is  to  vary  the  postu- 
lates governing  fusion.  Here  a  summary  of  Dunn's  and  Meyer  and  Roudey's 
work  on  the  general  theory  of  algebras  for  substructural  logics. 

Definition  23  (Posets)  A  poset  (a  partially  ordered  set)  is  a  set  equipped 
with  a  binary  relation  <  which  is  reflexive,  transitive,  and  asymmetric.  That 
is,  a  <  a  for  each  a,  if  a  <  b  and  b  <  c  then  a  <  c,  and  if  both  a  <  b  and 
b  <  a,  then  a  =  b. 

Posets  are  the  basic  structure  of  an  algebra  for  a  logic.  The  order  is  entailment 
between  propositions  in  structure.  Entailment  is  asymmetric  as  we  assume 
that  co-entailing  propositions  are  identical.  This  is  what  makes  propositions 
in  this  kind  of  structure  differ  from  sentences  in  a  formal  language. 

Extensional  conjunction  and  disjunction  enrich  the  poset  into  a  familiar 
algebraic  structure: 
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Definition  24  (Lattices)  A  lattice  is  a  partially  ordered  set  equipped  with 
least  upper  bound  V  and  a  greatest  lower  bound  A. 

A  lattice  is  distributive  if  and  only  ifaA(bVc)  =  (aAb)V(aAc)  holds  for 
each  a,  b  and  c. 

A  lattice  is  bounded  if  it  has  greatest  and  least  elements,  T  and  _L  respectively. 

For  traditional  substructural  logics,  two  more  additions  are  required  to  this 
kind  of  structure.  First,  negation,  and  second,  fusion  and  implication.  Let's 
tackle  negation  first. 

Definition  25  (Negations)  A  negation  on  a  poset  is  an  order  inverting  op- 
eration ~:  that  is,  if  a  <  b  then  ~b  <  -a. 

A  negation  on  a  lattice  is  de  Morgan  — a  =  a  and  it  satisfies  the  de  Morgan 
identities  ~( a  A  b)  =-aV4  and-(aVb)  =  ~aA~b. 

A  de  Morgan  negation  in  a  bounded  lattice  is  an  ortho-negation  if  a  A  ~a  =  _L 
and  aV-a  =  T. 

Note  that  a  de  Morgan  negation  need  not  be  an  ortho-negation.  (The  nega- 
tions in  each  of  the  structures  in  the  previous  section  are  de  Morgan  but  not 
ortho- negation.)  An  ortho-negation  operation  in  a  distributive  lattice  is  the 
Boolean  negation  in  that  structure.71 

Some  very  recent  work  of  Dunn  has  charted  even  more  possibilities  for  the 
behaviour  of  negation.  In  particular,  he  has  shown  that  a  basic  structure  in  a 
substructural  logic  is  a  split  negation  satisfying  the  following  residuation-like 
clauses 

a  <  -biffb  <  -a 

Given  this  equivalence,  both  and  —  are  negations,  and  both  satisfy  some  of 
the  de  Morgan  inequalities  but  not  others  [87]. 72 

The  most  interesting  operations  in  algebras  for  substructural  logics  are 
fusion  and  implication.  The  simplest  way  to  define  them  is  by  residuation. 

Definition  26  (Residuated  Pairs  and  Triples)  (o,— >)  is  a  residuated  pair 
in  a  poset  if  and  only  if  a  o  b  <  c  if  and  only  if  a  <  b  — >  c. 

(o,  — >,  <— )  is  a  residuated  triple  in  a  poset  if  and  only  if  a  o  b  <  c  if  and  only  if 
a  <  b  — >  c  if  and  only  if  b  <  c  <—  a. 

If  (o,— >)  is  a  residuated  pair  then  it  immediately  follows  that  o  is  isotonic  in 
both  places  with  respect  to  the  entailment  ordering.  That  is 

if  a  <  a'  and  b  <  b'  then  a  o  b  <  a'  o  b' 

71There  maybe  more  than  one  ortho-negation  in  a  lattice,  but  there  is  only  one  ortho-negation 
in  a  distributive  lattice. 

72  In  particular,  -■a  A^b  =  ^(aVb)  and  ^aV^b  <  ^(aAb)  (this  latter  inequality  is  satisfied 
by  any  negation)  but  the  converse  can  fail:  -■(a  A  b)  j£  ^aV  -■b.  The  negation  differs  from 
intuitionistic  or  minimal  negation,  however,  by  not  necessarily  satisfying  a  <  -■-■a.  We  do  have, 
however,  a  <  —  a  and  a  <  —  -■a.  For  an  example  of  a  split  negation,  let  —A  be  A  — *  f  and  let 
~A  be  f  <—  A  in  the  Lambek  calculus  extended  with  a  false  constant  f . 
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Implication,  on  the  other  hand,  is  not  isotonic  in  both  places.  It  is  isotonic  in 
the  consequentpla.ee  and  antitonicm  the  antecedent  place.  That  is  we  have 

if  a'  <  aandb  <  b' thenar  b  <  a'  ->  b' 

All  of  this  was  noticed  by  Meyer  and  Routley  in  the  1970s  and  made  rigor- 
ous (and  generalised  to  arbitrary  n-place  operations  and  residuated  families) 
by  Dunn  in  the  1980s  and  1990s  in  his  work  on  gaggle  theory  (from  "ggl"  for 
"Generalised  Galois  Logic":  a  Galois  connection  is  the  general  phenomenon 
of  which  a  residuated  pair  or  triple  is  a  special  case)  [85,  86]. 73 

Tonicity  is  not  the  only  behaviour  of  fusion  and  implication  present  in 
these  models.  If  the  poset  is  a  lattice  ordering,  then  tonicity  generalises  to 
distribution.  It  is  also  an  elementary  consequence  of  the  residuation  clause 
that  fusion  distributes  over  disjunction  in  both  places 

(aVa')ob  =  (aob)  V(a'ob)andao  (bVb'J  =  (aob)V(aob') 

and  implication  distributes  over  the  extensional  connectives  in  a  more  com- 
plicated fashion. 

(QVQ')^b  =  (a^b)A(a'^b)anda^  (bAb')  =  (a  ->  b)  A  (a  ->  b') 

These  sorts  of  structures  are  well  known,  and  they  appear  independently  in 
different  disciplines.  Quantales  [203]  are  but  one  example.  These  are  lattice- 
ordered  semigroups  (so,  o  is  associative)  with  arbitrary  disjunctions  but  only 
finite  conjunctions.  They  appear  in  both  pure  mathematics  and  theoreti- 
cal computer  science.  They  are  discussed  a  little  in  Vickers'  Topology  via 
Logic  [277] ,  which  is  a  useful  source  book  of  other  algebraic  constructions 
and  their  use  in  modelling  processes  and  observation.  The  existence  of  ar- 
bitrary disjunctions  means  that  in  a  quantale,  implication  is  definable  from 
fusion.  If  you  set  a  — >  b  as  follows 

a  — >  b  =  \J{x  :  x  o  a  <  b} 

then  — >  satisfies  the  residuating  condition  for  fusion.74  The  same  definition 
is  possible  in  the  other  direction  too.  If  you  have  a  lattice  with  arbitrary  con- 
junctions (and  implication  distributes  over  conjunction  in  the  right  way)  then 
you  can  define  fusion  from  implication 

aob  =  /\{x  :  a  <  b  — >  x} 

This  definition  is  key  to  one  of  the  important  techniques  in  understanding 
the  behaviour  of  fusion  and  the  connections  between  fusion  and  implication. 
For  fusion  plays  no  part  in  the  Hilbert  systems  introducing  some  substruc- 
tural  logics.  Yet  it  is  present  in  the  Gentzen  systems  (at  least  as  the  comma, 

73 And  I  have  begun  to  sketch  the  obvious  parallels  between  gaggle  theory  and  display  logic. 
Residuation  is  displaying,  and  isotonicity  and  antitonicity  have  connections  to  antecedent  and 
consequent  position  in  the  proof  rules  for  a  connective.  When  you  process  a  fusion,  the  subfor- 
mulas  remain  on  the  same  side  of  the  turnstile  as  the  original  formula.  On  the  other  hand,  when 
you  process  an  implication,  the  antecedent  swaps  sides  and  the  consequent  stays  put. 

74The  distribution  of  o  over  the  infmitary  disjunction  is  essential  here. 
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if  not  explicitly)  and  in  these  algebras.  Does  the  addition  of  fusion  add  any- 
thing new  to  the  system  in  the  language  of  implication?  Or  is  the  addition 
of  fusion  conservative?  In  the  next  section  I  will  sketch  Meyer's  techniques 
for  proving  conservative  extensions  for  many  substructural  logics,  by  way  of 
algebraic  models. 

Before  that,  I  must  say  a  little  about  truth  in  these  algebras.  In  Boolean 
algebras  (for  classical  logic)  and  Heyting  lattices  (for  intuitionistic  logic)  the 
truths  in  a  structure  are  the  formulas  which  are  interpreted  as  the  top  ele- 
ment. There  is  no  need  for  this  to  be  the  case  in  our  structures.  In  the  ab- 
sence of  K',  we  might  have  b  ^  a  — >  a.  That  means  that  a  true  conditional  (as 
every  identity  a  — >  a  is  true)  need  not  be  the  top  element  of  the  ordering.  So, 
instead  of  picking  out  true  propositions  as  those  at  the  top  of  the  ordering, 
substructural  logics  need  be  more  subde. 

Definition  27  (A  Truth  Set)  Given  an  algebra  with  — >,  the  truth  set  T  is  the 
set  of  all  x  where  a  — >  b  <  x  for  some  a,  b  where  a  <  b.75 

The  truth  set  is  the  set  of  all  conditionals  true  on  the  basis  of  logic  alone,  and 
anything  entailed  by  those  conditionals.  A  truth  set  has  some  nice  properties. 

Fact  28  (A  Truth  Set  in  a  Lattice  is  a  Filter)  Any  truth  set!  in  a  lattice  is 
a  filter.  (A  filter  is  a  set  which  is  closed  under  <,  and  closed  under  conjunc- 
tion.76) Ifx,y  e  T  thenx  Ay  e  T.  Ifx  <  x'  thenx'  e  T  too. 

Proof  That  T  is  closed  upwards  is  immediate.  That  T  is  closed  under  con- 
junction, note  that  if  x,y  e  T  then  a  — >  b  <  x  and  a'  — >  b'  <  y  where  a  <  b 
and  a'  <  b'.  ThenaAa'  <  b  Vb',  and  (a  ->  b)  A  (a'  ->  b')  =  aAa'  ->  bVb', 
so  since  (a  — >  b)  A  (a'  — >  b')  <  x  Ay,  we  have  x  Ay  e  T  too.  □ 

If  the  logic  contains  t,  then  the  truth  set  the  filter  generated  by  t:  T  =  {x  :  t  < 
x}.  The  presence  of  truth  sets  in  models  shows  that  a  logic  without  t  can  be 
conservatively  extended  by  it.  Both  conservative  extension  constructions  — 
due  to  Meyer  in  the  1970s,  are  the  topic  of  the  next  section. 

3.1.3    Conservative  Extension  Theorems 

Meyer's  conservative  extension  results  follow  the  one  technique  [179].  Sup- 
pose we  have  consecution  invalid  in  a  logic  with  a  restricted  language  A  1/  B. 
Then  (by  the  soundness  and  completeness  results  for  propositional  struc- 
tures) there  is  an  algebra  A  and  an  interpretation  [•]  into  A  where  [A]  j£  [B]. 
Then,  we  manipulate  A  into  a  new  structure  A',  appropriate  for  the  larger 
language,  and  in  which  we  have  a  new  interpretation  which  is  still  a  coun- 
terexample to  [Al  j£  [B].  There  are  two  separate  techniques  Meyer  pioneered. 
One,  injecting  a  structure  A  into  it's  completion  A*  (giving  us  a  way  to  inter- 
pret t,  o,  and  conjunction  and  disjunction  if  those  are  not  present),  and  then, 
taking  a  structure  A  and  pasting  on  an  inverted  duplicate  A°v,  in  order  to 
model  negation. 

75Equivalently,  it  is  the  set  generated  by  all  identities  q-)q,  since  a— >  a  <  a  — >  b  if  a  <  b. 
76It  is  the  algebraic  analogue  of  a  theory,  which  we  have  already  seen. 
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Example  29  (Mapping  A  into  A*)  The  map  from  a  propositional  structure 
into  its  completion  is  given  in  the  following  way.  A*  is  denned  in  a  number  of 
alternative  ways. 

o  If  ^4  is  only  a  poset  or  a  semilattice  (with  A  but  not  V) ,  then  A*  is  the  set  of 
all  downwardly  closed  sets  of  A.  That  is,  I  is  an  element  of  A*  if  and  only 
if  whenever  a  e  I  and  b  <  a  then  be  I. 
o  If  A  contains  disjunction,  then  A*  is  the  set  of  all  ideals  in  A.  (Ideals  are 
dual  to  filters.  They  are  closed  downward,  and  closed  under  disjunction: 
if  a,b  e  IthenaVb  e  I.) 
o  If  A  contains  disjunction  and  _L,  then  A*  is  the  set  of  nonempty  ideals  in 
A  —  every  ideal  must  contain  _L,  the  least  element  of  A. 
A*  is  a  complete  lattice  —  order  is  subsethood,  the  conjunction  of  a  class 
of  elements  is  their  intersection,  and  the  disjunction  of  a  class  of  elements 
is  the  intersection  of  all  elements  above  each  element  in  that  class.  It  is  not 
difficult  to  show  that  it  completely  distributive  if  the  original  lattice  contains 
no  counterexample  to  distributivity.  If  fusion  is  present  in  A  then  it  is  present 
in  .4*  too. 

IoJ={z:3xe  I,y  e  J(z<xoy)} 

and  other  connectives  lift  in  a  similar  way.  The  structural  rules  of  A  are  pre- 
served in  A*  77  This  shows  that  A*  has  the  nice  logical  properties  of  A. 

However,  since  A*  is  a  complete  lattice,  we  can  do  interesting  things  with 
it.  If  A  doesn't  contain  a  truth  element  t  as  a  left  identity  for  fusion,  A*  still 
does.  Since  A*  is  complete,  you  can  set  t  to  be  /\{I  — >  J  :  I  <  J}.  Then  to  I  <  J 
if  and  only  if  t  <  I  — >  J  if  and  only  if  I  <  J,  and  so,  t  is  a  left  identity  for  fusion. 

Furthermore,  the  map  from  A  to  A*  which  sends  a  to  J, a  =  {x  :  x  <  a} 
injects  one  structure  into  the  other,  preserving  all  of  the  operations  in  A.  Any 
consecution  with  a  counterexample  in  A  will  have  a  counterexample  in  A* 
too.  It  shows  that  linear  logic  without  the  additives  is  conservatively  extended 
by  additives  which  distribute,  for  example. 

Example  30  (Pasting  A  and  A°v  together)  Modifying  a  structure  in  such 
a  way  as  to  add  negation  is  more  difficult.  To  add  a  de  Morgan  negation  to 
a  structure,  we  need  an  upside  down  copy  A°v  of  A  so  that  for  negation  can 
be  an  order  inverting  map  of  period  two.  Following  the  details  of  this  con- 
struction will  be  a  great  deal  easier  if  we  take  A  to  include  top  and  bottom 
elements,  so  from  now  I  will  do  so. 

Conjunction  and  disjunction  in  A°v  can  be  defined  as  the  de  Morgan  dual 
of  that  in  A.  So,  if  a,  b  e  A°v  then  aAb  =  —(—a  V  — b),  where  —  is  the  map 
from  A  to  A°v  and  back.  Defining  conjunctions  and  disjunctions  of  elements 
between  A  and  A°v  depends  on  another  decision  we  need  take.  If  a  e  A 
and  b  e  A°p,  then  we  need  decide  on  what  we  take  a  A  b  and  a  V  b  to  be. 
There  are  three  options  for  this,  each  depending  on  the  relevant  positioning 
of  A  and  A°v .  Meyer's  original  choice  [179]  was  to  put  A°v  above  A.  Then  the 
disjunction  of  an  element  form  A  with  an  element  from  A°v  will  be  the  ele- 
ment from  A°v  and  the  conjunction  will  be  the  element  from  A.  This  choice 
(rather  than  putting  A°v  under  A)  is  the  one  to  take  if  you  wish  to  end  up  with 
a  model  for  the  relevant  logic  R,  for  we  wish  to  end  up  with  t  h  A  V  —A.  The 

77The  proof  is  tedious  but  straightforward  [234,  Chapter  9] . 
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element  t  is  in  A,  and  we  wish  it  to  be  under  each  a  V  —a.  But  a  V  —a  can 
be  any  element  in  the  top  half  of  the  model,  so  t  must  be  under  each  element 
in  the  top  half,  so  it  is  either  the  bottom  element  of  the  top  half  of  the  model 
(not  likely,  if  any  conditional  is  untrue  at  all  in  the  original  model)  or  it  is  in 
the  bottom  half. 

The  other  choice  for  ordering  the  two  components  —  putting  A°v  below 
A  —  is  required  if  you  wish  the  original  model  to  satisfy  K.  Then,  t  must  be  T, 
and  since  t  is  in  the  original  model,  it  must  be  at  the  top  of  the  new  model,  so 
A°v  can  go  underneath. 

There  is  one  other  natural  choice  for  the  ordering  of  A  and  A°v ,  and  that  is 
to  take  them  in  parallel.  You  can  paste  together  the  top  elements  both  mod- 
els and  the  bottom  elements  of  both  models  (or  add  new  top  and  bottom  el- 
ements if  you  prefer)  and  then  take  the  disjunction  of  a  pair,  one  from  A  and 
A°v ,  to  be  the  T  element  of  the  whole  structure,  and  the  conjunction  of  that 
pair  to  be  the  _L  element.  This  in  another  natural  option,  but  the  resulting 
lattice  is  not  distributive  if  A  is  not  trivial. 

To  make  the  resulting  structure  a  model  for  a  logic,  you  must  define  o  and 
— >  in  the  whole  structure.  Most  choices  are  fixed  in  advance,  if  fusion  is  com- 
mutative in  A.  Since  we  want  a  o  b  to  be  -(a  — >  -b),  and  a  — >  b  to  be 
— b  — >  —a,  we  take  nob  when  a  e  A  and  b  e  A°v  to  be  —(a  — >  — b)  (and 
the  dual  choice  when  a  e  A°v  and  bei).  The  remaining  choice  is  for  a  o  b 
where  a,  b  e  A°v .  Here,  it  depends  on  the  relative  position  of  A  and  A°v .  If 
we  add  the  new  structure  on  top,  take  ao  b  to  be  T.  If  we  add  the  new  structure 
below,  or  alongside,  take  ao  b  to  be  _L.  The  new  structure  satisfies  many  of  the 
structural  rules  of  the  old  structure,  and  as  a  result,  a  conservative  extension 
result  for  logics  in  the  vicinity  of  R  follows  [177,  179,  234]. 

There  is  one  substructural  logic  for  which  a  conservative  extension  by 
negation  fails:  RM.  RM  is  given  by  extending  R  with  the  mingle  rule  Ao  A  — >  A 
(or  equivalently,  A  h  A  — >  A).  If  you  add  mingle  to  positive  R  then  the  re- 
sult is  still  a  sublogic  of  intuitionistic  logic,  and  as  a  result,  total  ordering 
th  (A  ->  B)  V  (B  ->  A)  is  not  provable.  This  logic  is  called  RMO.  In  the 
presence  of  negation,  however,  the  addition  of  mingle  brings  along  with  it- 
self the  total  ordering  principle.  (This  result  is  due  to  Meyer  and  Parks,  from 
1972  [188].) 

3.2  Categories 

In  propositional  structures,  we  abstract  away  from  the  particulars  of  the  lan- 
guages in  which  our  propositions  are  expressed  to  focus  on  the  propositions 
themselves,  ordered  under  entailment.  In  propositional  structures,  propo- 
sitions are  first-class  citizens,  and  proofs  between  propositions  fade  into  the 
background.  If  there  is  a  proof  from  A  to  B,  then  [A]  <  [B].  The  differences 
between  proofs  from  A  to  B  are  not  registered  in  this  algebraic  semantics. 

Models  do  not  have  to  be  like  this.  We  can  consider  not  only  propositions 
as  objects  but  also  proofs  as  "arrows"  between  objects.  If  we  have  one  proof 
from  A  to  B,  we  might  indicate  this  as  'f  :  A  — «-  B,'  where  f  is  the  proof.  We 
might  have  another  proof  g  :  B  — ►  C,  and  then  we  could  compose  them  to 
construct  another  proof  gf  :  A  — ►  C,  which  runs  though  f  and  then  g. 
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Logicians  did  not  have  to  go  to  the  trouble  of  inventing  structures  like  this. 
It  turns  out  that  mathematical  objects  with  just  these  properties  have  been 
widely  studied  for  many  decades.  Categories  are  important  mathematical 
structures.  Category  theory  is  a  helpful  language  for  describing  constructions 
which  appear  in  disparate  parts  of  mathematics.  This  means  that  category 
theory  is,  by  its  nature,  very  abstract.  This  also  means  that  category  theory 
is  rich  in  examples,  interesting  categories  are  models  of  substructural  logics. 
In  particular,  I  will  look  at  one  example  categorical  model  of  a  logic,  Girard's 
model  of  coherence  spaces,  for  linear  logic.78 

To  understand  the  role  of  categories  as  models  of  logic,  you  need  to  focus 
on  one  particular  part  of  categorical  technology:  the  adjoint  pair.  An  adjoint 
pair  is  a  relationship  between  two  functors,  and  functors  are  structure  pre- 
serving maps  between  categories.  Thinking  of  a  category  as  a  model  of  a  logic 
generalising  an  algebra,  the  operators  such  as  fusion,  implication  and  so  on 
are  all  functors  from  the  category  to  itself  (or  perhaps,  from  the  category  to 
its  opposite,  which  is  found  by  swapping  arrows  from  a  to  b  to  go  from  b  to  a 
instead).  Operators  like  fusion,  which  are  isotonic,  are  really  two-place  maps 
from  a  category  to  itself,  not  only  sending  a  pair  of  category  objects  to  another 
object  (their  fusion)  but  also  sending  arrows  f  :  a  — ►  a'  and  g  :  b  — ►  b'  to 
an  arrow  f  o  g  :  a  o  b  — ►  a  o  b '. 

Definition  31  (Adjoint  Pairs)  If  F  :  C  — -  V  and  G  :  V  — -  V,  and  there  is 
a  bijection  cp  :  Homc(X,F(Y))  =  HomD(G(X),  Y),  natural  in  both  X  and  Y,79 
then  we  say  that  (F,  G,  cp)  is  an  adjunction.  F  is  the  left  adjoint  and  G  is  the 
right  adjoint. 

Example  32  (Adjunction  between  Fusion  and  Implication)  In  cartesian 
closed  categories,60  product:  —  x  B  is  a  functor  C  — -  C.  Similarly  [B  =4>  — ] 
is  a  functor  C  — >-  C.  These  functors  form  an  adjunction.  If  f  :  A  x  B  — «-  C, 
then  Af  :  A  — ►  [B  =#>  C].  Conversely,  if  g  :  A  — ►  [B  — >  C],  then  ev(g  x  idB)  : 
Ax  B  — ►  C.  This  is  a  bijection  Hom(  A  x  B,C)  =  Hom(A,  [B  C]). 

This  is  the  categorical  equivalent  of  the  residuation  between  extensional  con- 
junction and  intuitionistic  implication.  Cartesian  closed  categories  are  mod- 
els of  intuitionistic  logic  [152]. 

3.2. 1    Coherence  Spaces 

Coherence  spaces  arise  as  a  model  of  the  A-calculus,  and  intuitionistic  logic. 
They  provided  the  first  model  which  gave  Girard  an  insight  into  the  decom- 
position of  intuitionistic  implication  in  terms  of  linear  implication  and  the 
exponential !  [117,  121]. 

78In  a  history  like  this  I  can  only  assume  some  category  theory,  and  not  introduce  it  myself. 
Here  are  some  standard  references:  Mac  Lane's  Categories  for  the  Working  Mathematician  is  a 
very  good  introduction  to  the  area  [158],  very  readable,  even  by  those  who  are  not  working  math- 
ematicians. Barr  and  Wells'  Category  Theory  for  Computing  Science  is  also  clear,  from  a  perspec- 
tive of  the  theory  of  computation  [17].  Chapter  10  of  An  Introduction  to  Substructural  Logics  [234] 
contains  just  the  category  theory  you  need  to  go  through  the  detail  of  this  model.  Dosen's  paper 
"Deductive  Completeness"  is  a  clear  introduction  focussing  on  the  use  of  categories  in  logic  [72] . 

79I  can't  tell  you  what  it  is  for  a  bijection  to  be  natural,  unfortunately.  However,  Homc  (X,  Y)  is 
the  class  of  all  arrows  from  X  to  Y  in  the  category. 

80I  can't  tell  you  what  these  are,  for  lack  of  space. 
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Definition  33  (Coherence  Spaces)  A  coherence  space  is  a  set  A  of  sets,  sat- 
isfying the  following  two  conditions. 

o  If  a  e  A  and  b  C  a  then  b  e  A,  and 

o  If  for  each  x,y  e  a,{x,y}  e  A,  then  a  e  A. 
But  coherence  spaces  are  much  better  thought  of  as  undirected  graphs.  We 
say  a  coheres  with  b  (in  A)  if{x,y}  e  A.  We  write  this:  xCy  (mod  A).'  The  co- 
herence relation  determines  the  coherence  space  completely  Coherent  sets 
(a  e  A)  are  cliques  in  the  graph.  The  coherence  relation  is  reflexive  and  sym- 
metric, but  not,  in  general,  transitive.81 

Given  a  coherence  space  A,  we  define  coherence  relations  as  follows: 
o  x^y  (mod  A)  iffxOy  (mod  A)  andx  ^  y. 
o  xwy  (mod  A)  iff{x,y}  ^  A. 

o  xXy  (mod  A)  iff  it  is  not  the  case  that  x^y  (mod  A). 

Definition  34  (Product,  Sum  and  Negation  Spaces)  Given  spaces  A  and 
B,  the  coherence  spaces  AAB  and  AV  B  are  defined  on  the  disjoint  union  of 
the  points  x  of  the  graph  of  A  and  y  of  the  graph  of  B,  as  follows: 

(0,x)C(0,x;)  (mod  AAB)    iffxQx'  (mod  A) 
0,y)O(1,y')  (mod  AAB)    iffyOy'  (mod  B) 
(0,x)O(1,y)  (mod  AAB)  always 

(0,x)O(0,x')  (mod  AVB)  iffxQx'  (mod  A) 
(1,y)0(1,y')  (mod  AVB)  iffyOy'  (mod  B) 
(0,x)O(1,y)  (modAVB)  never 

Given  a  coherence  space  A  the  coherence  space  ~A  is  defined  by  setting  xCy 
(mod  ~A)  if  and  only  if  xXy  (mod  .A).  Note  that — A  =  A.  Sgl  =  {0,{*}},  the 
one-point  coherence  space.  Emp  =  {0},  the  empty  coherence  space.  Note  that 
~Sgl  =  Sgl  and~Emp  =  Emp. 

Note  that  here  ~~[A  AB)  =-/(V  and -(A  V  B)  =  ~A  A  ~-B.  Furthermore, 
.4,AEmp  =  EmpAA  =  A  =  AVEmp  =  EmpV  A.  Emp  does  the  job  of  both  T 
and  _L  in  the  category  of  coherence  spaces.82 

The  class  of  all  coherence  spaces  can  be  made  into  a  cartesian  closed  cat- 
egory, if  we  take  the  arrows  to  be  continuous  functions. 

Definition  35  (Continuous  Functions)  F  :  A  — -  B  is  continuous  if  and 
only  if 

o  If  a  C  bthenF(a)  C  F(b). 

o  If  S  C  A  is  directed  (that  is,  if  a,  b  e  S,  then  aUbeS  too)  then  F((J  S)  = 
LKF(a):aeS}. 


81Erhard's  hypercoherences  are  a  generalisation  of  coherence  spaces  which  are  richer  than  a 
graph  represents  [95].  In  hypercoherences,  a  might  be  a  coherent  set  without  a '  C  a  also  being 
coherent.  The  category  of  hypercoherences  is  also  a  model  of  linear  logic. 

82This  shows  how  categories  have  a  kind  of  flexibility  unavailable  to  posets.  In  a  poset,  T  =  _L 
only  if  the  poset  is  trivial.  In  a  category,  T  and  _L  might  be  identical  or  isomorphic,  without  the 
category  structure  being  trivial.  Yes,  there  will  be  arrows  from  every  object  to  every  other  object, 
but  it  is  not  the  case  that  all  objects  are  isomorphic. 
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Fact  36  (Minimal  Representatives)  IfT:A  — -  B  is  continuous,  and  if  a  e 
A  andy  e  F(a),  then  there  is  a  minimal  finitea'  e  A  wherey  e  F(a'). 

Proof  Ify  e  F(a)theny  e  F(a*)  for  some  finite  a*.  Pick  some  smallest 
subset  a'  of  a*  with  this  property.  (This  is  possible,  as  a*  is  finite.)  □ 

We  want  to  construct  a  coherence  space  representing  F  :  A  — -  B.  We  start  by 
defining  the  trace  of  a  function. 

Trace(F)  =  {(a,y)  e  Aqu  x  \B\  :  y  e  F(a)  and  a  is  minimal} 

Note  thatTrace(F)  C  Aqu  x  \B\  has  the  following  properties. 

o  If  (a,y),  (a,y')  G  Trace(F)  thenyOy'  (mod  6). 

o  If  a'  C  a,  (a,y),  (a',y)  e  Trace(F),  then  a  =  a'. 
Conversely,  if  £  is  any  set  with  these  two  properties,  then  define  Fs  by  setting 

hW  =  {y  G  |B|  :  3a'  C  a  where  (a',y)  e  3} 

We  can  represent  continuous  functions  by  their  traces.  In  fact,  if  F  is  con- 
tinuous, then  F  =  Fjraceu)-  Can  we  define  a  coherence  relation  on  traces? 
Consider  the  special  case  where  there  are  two  minimal  representatives,  that 
is,  (a,y),(a',y)  e  Trace(F).  Under  what  circumstances  are  they  coherent? 
Unfortunately  we  need  more  information  in  order  to  define  a  coherence  re- 
lation —  we  need  a  relationship  between  a  and  a'.  We  can  show  that  in  a 
particular  class  of  continuous  functions,  there  is  always  a  unique  minimal  a. 

Definition  37  (Stable  Functions)  F  :  A  — ►  B  is  stable  if  it  is  continuous, 
and  in  addition,  whenever  a,  a',  aU  a'  e  A,  then  F(a  n  a')  =  F(a)  n  F(a'). 

With  stable  functions,  we  can  choose  a  unique  minimal  representative  a. 

Fact  38  (Unique  Minimal  Representatives)  F  :  A  — -  B  is  stable  if  and 
only  if  for  each  a  e  A,  wherey  e  F(a),  there  is  a  unique  minimal  a'  e  Afln 
such  thaty  e  F(a'). 

Proof  For  left  to  right,  it  is  straightforward  to  check  that  a'  =  f|{Q*  €  A  : 
a*  C  a,  wherey  e  F(a*)}  is  the  required  a'.  For  right  to  left,  monotonicity 
tells  us  that  F(an  a')  C  F(a)  andF(an  a')  C  F(a'),  so  F(an  a')  C  F(a)nF(a'). 
Conversely,  if  a,  a',  aUa'ei,  then  if  y  e  F(a)  and  y  e  F(a'),  then  y  is  in 
F(a")  for  a  unique  minimal  a".  Therefore  a"  C  a  and  a"  C  a',  so  a"  C  an  a', 
and  hence  y  e  F(a")  C  F(a  n  a'),  as  desired.  □ 

The  next  result  is  simple  to  verify. 

Fact  39  (Characterising  Stable  Functions)  If  F  is  stable,  then  whenever 
(a.yUa'.y')  e  Trace(F) 

o  If  aU  a'  e  A  thenyO-y'  (mod  B). 

o  If  qU  a'  ^  A  theny  =y'  (mod  B). 
Conversely,  if  the  set$  satisfies  these  conditions,  then  Fy  is  stable.  □ 

Given  this,  we  can  define  A  z>  B.  \A  z>  B\  =  Afinx\B\  as  follows:  (a,y)C(a',y ') 
(mod  Ad  B)  if  and  only  if 

o  If  aU  a'  e  ylthenyCy'  (mod  B). 
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o  If  aU  a'  e  .4  then  y  =  y'  (mod  B). 
That  is,  A  D  B  —  {Trace(F)  |  F  :  A  — -  B  is  stable}.  The  category  of  coherent 

spaces  and  stable  functions  between  them  is  cartesian  closed.  This  construc- 
tion is  obviously  a  two-stage  process.  It  begs  to  be  decomposed.  We  should 
define  a  coherence  space !  A  on  the  set  of  finite  coherent  sets  of  _4fin  as  follows: 

aCa'  (mod  !A)  iff  a  U  a'  e  A 

and  define  linear  implication  A  — >  B  by  setting  (x,y)C(x' ,y')  (mod  A  — >  B) 
if  and  only  if 

o  IfxQx'  (mod  A)  then yOy'  (mod  B). 
o  IfxQx'  (mod  .4)  andy  =  y '  thenx  =  x'. 

Note  that  A  — >  B  is  (isomorphic  to)  ~Z?  — >  ~A  Furthermore,  Sgl  — >  .4  is 
(isomorphic  to)  .4  and  .4  — >  Sgl  is  (isomorphic  to)  ~A.  The  operation  z>  stands 
to  stable  functions  as  — > stands  to  a  new  kind  of  function:  the  linear  functions. 


Definition  40  (Linear  Maps)  F  is  a  linear  map  if  and  only  if  whenever  A  c 
A  is  linked  (that  is,  if  a,  b  e  A  then  a  U  b  e  A)  then  F((J  A)  =  \J{T(a)  :  a  e  A}. 

If  F  is  linear  then  F  is  stable  (this  is  straightforward)  and  in  addition,  if  x  e  F(a) 
then  the  minimal  b  where  x  e  F(b)  is  a  singleton.  It  follows  that  the  trace  of  F 
can  be  simplified.  The  linear  trace  of  a  linear  map  F  is  defined  as  follows: 

Trlin(F)={(x,y):y  e  F({x})} 

Therefore,  A  ->  B  =  {Trlin(F)  F  :  A  — ►  B  is  linear}. 

Given  — >,  we  can  see  that  it  is  connected  by  an  adjunction  to  a  natural 
fusion  operation.  We  can  define  A  o  B  as  follows:  |.4  o  B|  =  \A\  x  and 
(x,y)C(x',y')  if  and  only  if  xQx'  (mod  ^4)  andyCy '  (mod  B). 

Fact  41  (The  Adjunction  between  Fusion  and  Implication)  In  the  cate- 
gory of  coherence  spaces  with  linear  maps 

Hom(A  oB,C)=  Hom[A,  B  ->  C) 

is  an  adjunction  for  all  A,  B  andC. 

This,  with  the  associativity  and  commutativity  of  o,  together  with  the  be- 
haviour of  !,  shows  that  the  category  of  coherence  spaces  and  linear  maps 
is  a  model  of  linear  logic. 

Some  very  recent  work  (2001)  of  Schalk  and  de  Paiva's  on  poset-valued 
sets  [248]  generalises  coherence  spaces  in  an  interesting  direction.  They  show 
that  coherence  spaces  and  hypercoherences  can  be  seen  as  maps  from  Set  x 
Set  to  the  algebra  RM3.83  If  x^y  then  f [x,y]  =  t,  if  x^y  then  f(x,y)  =  f,  if 
x  =  y  then  f(x,y)  =  b.  The  logical  operators  of  negation,  fusion  and  impli- 
cation then  lift  from  the  algebra  to  the  coherence  spaces.  (In  other  words,  if 
A  :  Set  x  Set  — >  RM3  is  a  coherence  space,  then  ~A  is  the  map  composing  A 

83They  do  not  recognise  that  the  algebra  is  already  quite  studied  in  the  relevant  logic  literature. 


Greg  Restall,  Greg . Restaliamq .  edu .  au 


June  23,  2001 


http: //www. phil .mq.edu. au/staf f /grestall/ 


68 


with  ~  :  RM3  — >  RM3.  The  same  goes  for  the  other  operations.)  Different  cat- 
egorical models  in  the  style  of  coherence  spaces  can  then  be  given  by  varying 
the  target  algebra.  I  suspect  that  using  some  of  the  algebras  known  in  the  sub- 
structural  literature  will  lead  to  interesting  categorical  models  of  linear  logic 
and  related  systems. 

Girard  has  shown  that  Banach  spaces  can  be  used  in  place  of  coherent 
spaces  to  model  linear  logic  [120].  The  norm  in  a  Banach  space  takes  the 
place  of  the  coherence  relation.  As  we  shall  see  later,  it  is  not  the  only  point  at 
which  geometric  intuitions  have  come  to  play  a  role  in  substructural  logic. 

3.3  Frames 

The  study  of  modal  logic  found  new  depth  and  vigour  with  the  advent  of  pos- 
sible worlds  semantics.  As  we  have  seen,  algebras  are  useful  models  of  sub- 
structural  logics.  However,  they  are  so  close  to  the  proof  theory  of  these  logics 
that  they  do  not  provide  a  great  deal  of  new  information,  either  about  the  in- 
trinsic properties  of  the  logic  in  question,  or  about  how  it  is  to  be  applied. 
Models  in  terms  of  frames  are  one  way  to  extract  more  information.  Perhaps 
this  is  because  frames  are  a  further  step  removed  from  the  logic  in  an  im- 
portant sense.  In  algebras,  each  formula  in  the  language  is  interpreted  as  an 
element  in  the  algebra.  In  frames,  each  formula  is  not  interpreted  as  an  ele- 
ment in  the  frame  —  the  elements  in  the  frame  lie  underneath  the  interpre- 
tation of  formulas.  Formulas  are  interpreted  as  collections  of  frame  elements. 
Therefore  the  interpretations  of  connectives  on  a  frame  are  themselves  de- 
composed. They  are  no  longer  simply  functions  on  algebras  satisfying  spec- 
ified conditions.  Their  action  on  sets  of  frame  elements  is  factored  through 
their  action  on  individual  frame  elements.  As  a  result,  frame  interpretations 
of  logics  can  be  thought  to  carry  more  information  than  algebras.  In  addi- 
tion, frame  semantics  is  suggestive  of  applications  of  logics.  Just  as  the  idea 
of  the  interpretation  of  a  proposition  as  a  set  of  possible  worlds,  or  a  set  of 
times  or  a  set  of  locations  has  driven  the  application  of  different  models  of 
modal  or  temporal  logics,  so  the  interpretation  of  frame  semantics  for  sub- 
structural  logics  has  led  to  their  use  in  diverse  applications.  But  enough  of 
scene-setting.  Let's  start  with  a  the  first  attempts  to  give  precise  frame  se- 
mantics for  substructural  logics.  As  before,  our  story  starts  with  the  relevant 
logic  R. 

3.3.1    Operational  Frames 

The  idea  of  frame  semantics  for  relevant  logics  occurred  independently  to 
Routley  and  to  Urquhart  in  the  late  1960s  and  early  1970s.  Routley's  tech- 
niques are  more  general  than  Urquhart's,  but  Urquhart's  were  published  first, 
and  are  the  simplest  to  introduce,  so  we  will  start  with  them. 

Consider  the  constraints  for  developing  a  frame  semantics  for  a  relevant 
logic.  The  bare  bones  of  any  frame  semantics  are  as  follows.  A  frame  is  a  set 
of  objects  (call  them  points,  though  "worlds",  "situations",  "set-ups"  and  other 
names  have  all  been  used),  and  a  model  on  that  frame  is  a  relation  Ih  which 
indicates  what  formulas  are  true  at  what  points.  We  read  "x  Ih  A"  as  "A  is 
true  at  x."  Typically,  the  relation  Ih  is  constrained  by  inductive  clauses  that 
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indicate  how  the  truth  (or  otherwise)  of  a  complex  formula  at  each  point  is 
determined  by  the  truth  (or  otherwise)  of  its  subformulas.  Given  a  particular 
model,  then,  we  say  that  A  entails  B  on  that  model  if  and  only  if  for  every 
point  x,  if  x  I  h  A  then  x  lh  B.  Entailment  is  preservation  of  truth  at  all  points 
in  a  model. 

This  is  the  bare  bones  of  a  frame  semantics  for  a  logic.  Consider  how  this 
determines  what  we  can  do  to  interpret  relevant  implication.  It  is  axiomatic 
for  a  relevant  logic  that  the  entailment  from  A  to  B  — >  B  can  fail.  In  frame 
terms  this  means  that  we  must  have  points  in  our  models  in  which  B  — >  B 
can  fail.  This  means  that  the  interpretation  of  implication  must  differ  from 
any  kind  of  frame  interpretation  of  conditionals  seen.  For  a  strict  conditional 
A  ^  B  to  be  true  at  a  world,  we  need  to  check  all  accessible  worlds,  to  see  if 
B  is  true  whenever  A  is  true.  As  a  result,  B  B  is  true  at  every  world.  Sim- 
ilarly, for  counterfactual  conditionals  A  CR  B,  we  check  the  nearby  worlds 
where  A  is  true,  to  see  if  B  is  true  there  too.  Again,  B  CR  B  is  true,  because 
we  check  the  consequent  at  the  very  same  points  in  the  model  where  we  have 
taken  the  antecedent  to  be  true.  Something  different  must  be  done  for  a  rel- 
evant conditional.  At  the  very  least  we  need  to  check  the  value  of  the  con- 
sequent somewhere  at  places  other  than  simply  where  we  have  checked  the 
antecedent. 

Urquhart's  innovation  was  a  natural  way  to  do  just  this  [265, 266,  267,  268] . 
Consider  again  what  an  implication  A  — >  B  says.  To  be  committed  to  A  — >  B 
is  to  be  committed  to  B  whenever  we  gain  the  information  that  A.  To  put  it 
another  way,  a  body  of  information  warrants  A  — >  B  if  and  only  if  whenever 
you  update  that  information  with  new  information  which  warrants  A,  the  re- 
sulting (perhaps  new)  body  of  information  warrants  B.  Putting  this  idea  in 
technical  garb,  we  get  a  familiar-looking  inductive  clause  from  a  frame  se- 
mantics: 

o  x  lh  A  — >  B  if  and  only  if  for  eachy,  if  y  lh  A  thenx  U  y  lh  B. 

But  this  inductive  clause  has  a  new  twist.  Unlike  the  clauses  for  strict  or 
counterfactual  conditionals,  in  this  clause  we  check  the  consequent  and  an- 
tecedent at  different  points  in  the  model  structure.  The  way  is  open  for  B  — >  B 
to  fail. 

Let's  take  some  time  to  examine  the  detail  of  this  clause.  We  have  a  class 
of  points  (over  which  "x"  and  "y"  vary),  and  a  function  U  which  gives  us  new 
points  from  old.  The  point  x  U  y  is  supposed,  on  Urquhart's  interpretation, 
to  be  the  body  of  information  given  by  combining  x  with  y.  The  properties 
we  take  combination  to  have  will  influence  the  properties  of  the  conditional. 
First  up,  let's  consider  our  old  enemy,  A  h  B  — >  B.  For  this  to  fail,  we  need 
have  a  point  x  where  x  \f  B  — >  B,  and  for  this,  we  need  just  some  y  where 
y  lh  B  and  x  U  y  \f  B.  This  means  that  combination  of  bodies  of  information 
cannot  satisfy  this  hereditary  condition: 

o    If  x  lh  A  thenx  U  y  lh  A     left  hereditary  condition 

Similarly,  if  we  are  to  have  A  h  B  — >  A  to  fail,  then  combination  cannot  satisfy 
the  dual  hereditary  condition. 

o    Ifx  lh  A  then  y  Ux  lh  A     right  hereditary  condition 
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This  means  that  combination  is  sometimes  nonmonotonic  in  a  natural  sense. 
Sometimes  when  a  body  of  information  is  combined  with  another  body  of 
information,  some  of  the  original  body  of  information  might  be  lost.  This  is 
simplest  to  see  in  the  case  motivating  the  failure  of  A  h  B  — >  A.  A  body  of 
information  might  tell  us  that  A.  However,  when  we  combine  it  with  some- 
thing which  tells  us  B,  we  the  resulting  body  of  information  might  no  longer 
warrant  A  (as  A  might  conflict  with  B).  Combination  might  not  simply  result 
in  the  addition  of  information.  It  may  well  warrant  its  revision. 

To  model  the  logic  R,  combination  must  satisfy  a  number  of  properties: 

o    xUy—yLix  commutativity 
o    (x  U  i|)  U  z  =  x  U  (y  U  z)  associativity 
o    xUx  =  x  idempotence 

Commutativity  gives  us  assertion,  associativity  gives  us  prefixing  and  suffix- 
ing, and  idempotence  gives  us  contraction,  as  is  easily  verified.  For  example, 
consider  assertion:  to  verify  that  A  h  (A  — >  B)  — >  B,  suppose  that  x  lh  A.  To 
show  that  x  lh  ( A  — >  B )  — >  B,  take  a  y  where  y  lh  A  — >  B.  We  wish  to  show  that 
x  U  y  lh  B.  By  commutativity,  xUy  =  y  U  x,  and  since  y  lh  A  — >  B  and  x  lh  A, 
we  can  apply  the  conditional  clause  at  y  to  give  yUxlh  B.  So,  xUy  lh  Bas 
desired. 

These  frame  properties,  governing  the  behaviour  of  U  are  very  similar  in 
scope  to  the  structural  rules  governing  fusion  and  intensional  combination 
in  different  proof  theories.  This  is  no  surprise,  as  U  is  the  frame  analogue  of 
fusion.  It  comes  as  no  surprise,  then,  that  as  you  vary  conditions  on  U  you 
can  model  different  substructural  logics. 

Keeping  the  analogy  afloat,  then,  we  can  see  how  these  models  might  in- 
terpret theoremhood  in  our  logics.  In  analogy  with  the  proof  theory  and  alge- 
braic models  of  our  logics,  we  can  see  that  there  are  two  different  grades  of 
truth.  It  is  one  thing  for  a  formula  to  be  true  everywhere  in  a  model  —  this 
corresponds  to  being  entailed  by  the  Church  true  constant  T.  It  is  another 
thing  for  it  to  be  a  tautology,  for  it  to  be  entailed  by  the  Ackermann  true  con- 
stant t.  Identities  are  entailed  by  t.  What  corresponds  to  being  a  tautology 
in  this  sense  in  our  models?  Clearly  being  true  at  every  point  is  ruled  out,  as 
identities  can  fail  at  different  points  in  a  model.  Continuing  the  interpreta- 
tion of  points  as  bodies  of  information,  if  we  can  have  bodies  of  information 
which  do  not  warrant  all  of  the  tautologies  of  logic,  then  we  need  some  way 
of  talking  about  which  bodies  of  information  do.  The  simplest  approach  (and 
the  one  which  Urquhart  took)  is  to  take  a  special  body  of  information  0  to 
stand  for  "logic."  A  natural  condition  to  take  on  0  is  that  it  is  a  left  identity  for 
composition 

o    0  U  x  =  x     left  identity 

In  this  way,  0  lh  A  — >  B  if  and  only  if  for  each  x,  if  x  lh  A  then  x  lh  B  —  so  the 
conditionals  warranted  by  logic  correspond  to  exactly  the  entailments  valid 
in  the  frame.  The  identity  point  0  does  a  good  job  of  modelling  logic. 

The  interpretation  of  points  as  bodies  of  information  warrants  a  simple 
interpretation  of  conjunction  as  well.  The  usual  clause 

o  x  lh  A  A  B  if  and  only  if  x  lh  A  and  x  lh  B 
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is  uncontroversial.  If  a  body  of  information  warrants  A  A  B  it  warrants  A  and 
it  warrants  B,  and  conversely.  Adding  this  clause  to  the  semantics  gives  us  the 
conjunction  and  implication  fragment  of  R  (and  its  neighbours,  varying  the 
behaviour  of  of  U). 

Intensional  conjunction  is  also  straightforward.  We  can  add  fusion  as  the 
object-language  witness  of  composition: 

o  x  lh  A  o  B  if  and  only  if  for  some  y ,  z  where  x  =  y  U  z,  y  lh  A  and  z  lh  B. 

It  is  instructive  to  verify  that  in  these  models,  that  A  o  B  h  C  if  and  only  if 
A  h  B  — >  C.  Residuation  between  — >  and  o  corresponds  to  the  universal 
clause  modelling  — >  interacting  with  the  existential  clause  modelling  o. 

Let's  now  turn  to  soundness  and  completeness  with  respect  to  these  mod- 
els. To  prove  soundness  of  a  proof  theory  with  respect  to  these  models,  it  is  re- 
quired only  to  show  that  everything  provable  in  the  proof  theory  holds  in  the 
model  (either  holds  at  0  for  a  Hilbert  system,  or  holds  over  the  entire  frame  for 
a  proof  theory  which  delivers  consecutions) .  As  usual,  verifying  soundness  is 
a  straightforward  matter  of  checking  axioms  and  rules. 

There  are  two  different  ways  to  prove  the  completeness  of  a  proof  the- 
ory with  Urquhart's  operational  models.  Again,  as  usual,  the  common  tech- 
nique is  to  provide  a  counterexample  for  an  unprovable  formula  (or  conse- 
cution) .  Both  techniques  use  a  canonical  model  construction,  familiar  from 
the  worlds  semantics  for  modal  logics.  Where  these  constructions  differ  is  in 
the  stuff  out  of  which  the  points  in  the  model  are  made.  The  first,  and  most 
general  kind  of  canonical  model  we  can  provide  for  an  operational  semantics 
is  the  theory  model,  in  which  the  points  are  all  of  the  theories  of  the  logic  in 
question. 

Definition  42  (The  Theory  Canonical  Model)  The  set  of  points  is  the  set 
T  of  theories.  The  identity  theory  is  the  set  L  of  all  of  the  tautologies  of  the 
logic.  The  composition  relation  U  is  defined  as  follows: 

S  U  T  =  {B  :  (3A)  (A  ->  B  e  S  and  A  e  T)} 

and  S  lh  A  if  and  only  if  A  e  S. 

To  verify  that  the  theory  canonical  model  is  indeed  a  canonical  model  we 
must  show  that  U  so  defined  satisfies  all  of  the  conditions  of  a  composition 
relation,  and  that  lh  satisfies  the  recursive  conditions  of  an  evaluation  rela- 
tion. 

To  show  that  U  satisfies  the  conditions  of  composition,  you  need  first  show 
that  U  is  indeed  a  function  on  the  class  of  theories:  that  if  S  and  T  are  theories, 
so  is  S  U  T.  The  verification  of  this  fact  is  elementary.  The  frame  conditions  on 
U  correspond  quite  neatly  to  axioms  or  structural  rules. 

To  show  that  lh  satisfies  the  recursive  conditions,  you  need  show  that  A  A 
B  6  T  if  and  only  if  A,  B  e  T  (which  is  an  immediate  consequence  of  the 
definition  of  a  theory)  and  that  A  — >  B  e  S  if  and  only  if  for  each  T  where  A  e 
T,  B  e  S  U  T.  The  verification  from  left  to  right  is  an  immediate  consequence 
of  the  definition  of  U.  The  verification  from  right  to  left  is  simplest  to  prove 
in  the  contrapositive:  that  if  A  — >  B  ^  S  then  there  is  a  T  where  A  e  T  and 
B  £  S  U  T.  Finding  such  a  T  is  easy  here:  let  T  =  {C  :  A  h  C}.  If  B  e  S  U  T 
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then  there  is  some  C  e  T  where  C  — >  B  e  S.  Since  A  h  C,  it  follows  that 
C^BhA^B  (by  monotonicity  of  — >)  and  A^BeS  contrary  to  what  we 
have  assumed. 

It  is  possible  to  extend  this  kind  of  completeness  proof  to  show  that  the 
condition  for  fusion  models  this  connective  correctly  too. 

Definition  43  (The  Finite  Set  Canonical  Model)  The  points  are  the  finite 
sets  of  formulas.  Composition  U  is  set  union.  {Ai , . . . ,  An}  Ih  B  if  and  only  if 

hAi  ->  (  ►  (An-»B)) 

(The  permutation  axiom  shows  that  the  order  of  presentation  in  the  set  is 
irrelevant  in  this  definition.) 

It  is  not  difficult  to  show  that  this  is  indeed  a  model  —  that  the  recursive 
clause  defining  — >  is  satisfied. 

This  is  a  simple  model  which  gives  a  straightforward  counterexample  to 
any  invalid  argument.  If  A  1/  B  then  {A}  is  the  point  in  the  model  invalidating 
the  argument:  {A}  Ih  A  and  {A}  f  B. 

Operational  frames  are  important  models  of  other  substructural  logics  too. 

Example  44  (Language  frames)  A  language frame  on  alphabet  A  is  the  col- 
lection of  all  strings  on  that  alphabet,  with  U  defined  as  concatenation. 

Language  frames  are  a  model  of  the  Lambek  calculus.  The  composition  op- 
eration U  is  associative  but  not  commutative  (except  in  the  case  where  A  is  a 
singleton).  It  was  an  open  question  for  many  years  whether  or  not  the  Lam- 
bek calculus  is  complete  for  Language  frames.  Mati  Pentus  showed  that  it 
is,  using  an  ingenious  (and  difficult!)  model  construction  argument  pasting 
chains  of  partial  models  together  to  form  a  string  model  [210,  211].  Different 
frames  for  the  Lambek  calculus  feature  prominently  in  some  recent  work  on 
the  system  and  its  linguistic  applications  [200,  202]. 

Example  45  (Domain  Spaces)  Models  of  the  A-calculus  [1, 126, 251, 252]  are 
models  for  substructural  logics  too.  Scott's  famous  model  construction  in- 
volves a  topological  space  D  such  that  D  is  isomorphic  to  the  space  [D  — >  D] 
of  continuous  functions  from  D  to  itself.  Each  element  of  D  is  paired  with 
a  function  in  [D  — >  D],  so  can  think  of  the  objects  equally  well  as  functions. 
Therefore,  there  is  a  two -place  operation  of  application  on  the  domain.  Con- 
sider x(y )  —  the  application  of  x  to  y .  We  can  assign  types  to  functions  in  this 
model  by  "reading"  the  model  as  a  frame  for  a  logic.  If  we  setxU  y  to  bex(y), 
then  this  is  an  operational  frame: 

x  Ih  A  — >  B  if  and  only  if  for  eachy,  where  y  Ih  A,  x(y)  Ih  B. 

In  other  words,  x  is  of  type  A  — >  B  if  and  only  if  whenever  given  an  input  of 
type  x,  the  output  is  of  type  B.  This  gives  us  a  plausible  notion  of  function 
typing.  For  example,  Ax.(x  +  1 )  will  have  type  Even  — >  Odd  and  Odd  — >  Even. 
The  function  Ax.Ay .  (2x  +  y )  has  type  N  — >  ( Odd  — >  Odd)  (whatever  number  x 
is,  if  y  is  odd,  so  is  2x  +  y)  but  it  does  not  have  type  Odd  — >  [N  — >  Odd)  (if  y 
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is  even,  the  output  will  be  even,  not  odd) .  This  is  an  example  demonstrating 
the  failure  of  the  permutation-related  rule:  A  — >  (B  — >  C)  h  B  — >  (A  — >  B).84 

This  is  an  important  model  because  it  motivates  the  failure  of  not  only 
commutativity  of  U  but  also  associativity.  There  is  no  sense  in  which  x(y(z)) 
ought  be  equal  to  (x(y))(z).  Function  typing  models  a  very  weak  substruc- 
tural  logic. 

Urquhart  considered  adding  disjunction  to  operational  frames,  with  the  nat- 
ural clause: 

o  xlh  AVBifandonlyifxIh  Aorxlh  B 

However,  this  is  not  as  satisfactory  as  its  cousin  for  conjunction.  For  one 
thing,  R  models  extended  with  this  clause  validate  the  following  formula 

(A-)BVC)A(B-)C)^(A->C)  (1) 

which  is  not  valid  in  R.85  Secondly,  and  more  importantly,  the  interpretation 
in  terms  of  pieces  of  information  simply  doesn't  motivate  the  straightforward 
clause  for  disjunction.  Pieces  of  information  may  well  warrant  disjunctions 
without  warranting  either  disjunct.  To  interpret  disjunction  in  operational 
models  (and  to  get  a  logic  in  the  vicinity  of  R  or  any  of  the  other  logics  we 
are  interested  in)  you  can  do  one  of  two  things.  One  approach  is,  taken  by 
Ono  [205,  206],  Dosen  [68,  69]  and  Wansing  [284],  is  to  admit  some  kind  of 
closure  operator  on  the  frame:  A  V  B  is  true  not  only  at  points  where  A  is  true 
and  where  B  is  true,  but  also  at  some  more  points,  found  by  closing  the  orig- 
inal set  under  some  operation.  Doing  this  will  almost  invariably  invalidate 
distribution,  and  we  will  look  at  one  example  of  this  kind  of  semantics  in  the 
a  couple  of  section's  time,  when  we  come  to  phase  space  models  for  linear 
logic. 

A  related  method,  and  one  which  validates  distribution,  was  discovered 
by  Kit  Fine  [97,  99]  in  the  mid  1970s.  He  showed  that  if  you  have  a  two  tiered 
collection  of  points,  the  whole  class  S  with  a  special  subset  P  of  prime  points 
(in  analogy  with  prime  theories)  which  respect  disjunction.  For  points  in  P,  a 
disjunction  is  true  if  and  only  if  a  disjunct  is.  For  arbitrary  points  in  S  this  may 
fail.  For  an  arbitrary  point  in  S,  however,  you  have  a  guarantee  that  there  it 
can  be  covered  by  a  point  in  P.  For  each  s  e  S  there  is  at  least  one  s'eP  where 
s  c  s'.  This  means  that  disjunctions  are  at  least  promissory  notes:  although 
a  disjunct  may  not  be  true  given  this  body  of  information,  it  is  possible  for 
the  information  to  be  filled  out  so  that  you  get  one  or  other  disjunct.  Then,  a 
disjunction,  in  a  Fine  model,  is  evaluated  like  this: 

o  x  lh  A  V  B  if  and  only  if  for  eachy  e  T  where  x  C  y,  y  lh  A  ory  lh  B. 

Fine's  models  will  satisfy  distribution,  and  model  the  positive  fragment  of  R 
nicely.  They  do  so  at  the  cost  of  requiring  special  bodies  of  information,  those 

84We  can  use  other  connectives  to  expand  the  type  analysis  of  terms.  Conjunction  clearly 
makes  sense  in  this  interpretation:  x  lh  A  A  B  if  and  only  if  x  lh  A  and  x  lh  B.  In  this  way, 
we  have  models  not  only  for  typing  functions  with  — »  but  also  with  intersection.  These  are  mod- 
els for  the  Torino  type  system  An  [16,  60,  135,  276]. 

85This  is  not  to  say  that  the  operational  semantics  with  this  disjunction  clause  hasn't  been 
investigated.  See  some  interesting  papers  of  Charlewood:  [52,  53],  following  on  from  a  result  of 
Fine  [98]. 
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which  are  prime.  They  also  have  the  cost  of  requiring  a  new  notion  C,  of  infor- 
mational inclusion.  This  requires  a  new  condition  on  frames,  the  hereditary 
condition,  familiar  from  models  for  intuitionistic  logic: 

Definition  46  (Hereditary  Condition)  If  x  II-  p  andx  c  p  thenp  Ih  p  too. 

To  show  that  the  hereditary  condition  extends  from  atomic  propositions  to  all 
propositions,  a  further  model  condition  is  required  to  validate  the  inductive 
step  for  the  conditional.  You  need  to  assume  that 

If  x  C  x'  andp  C  p'  thenx  U  y  C  x'  Up'. 

Given  this  clause,  we  indeed  have  a  model  for  positive  R.  The  cost  has  been 
a  complication  of  the  clause  for  disjunction,  the  requirement  that  we  have  a 
two-tiered  universe  of  points,  and  a  hereditary  condition  on  points.  This  is 
not  the  only  way  to  model  the  whole  of  R.  Routley  and  Meyer,  independently 
of  Fine,  came  to  an  equally  powerful  semantics,  with  a  slightly  smaller  set  of 
primitive  notions.  Before  looking  at  the  Routley-Meyer  semantics  in  the  next 
section,  I  must  say  a  little  about  negation  in  the  operational  semantics. 

How  one  interprets  negation  depends  to  a  great  extent  on  the  intended 
interpretation.  The  Boolean  clause  for  negation 

o  x  Ih  -A  if  and  only  if  x  \f  A 

is  marvellously  appropriate  in  string  models  of  the  Lambek  calculus  (a  string 
is  of  type  "not  a  noun"  just  when  it  is  not  of  type  "noun")  and  in  function  typ- 
ing (a  function  like  Ax.x2  has  type  —Even  — >  —Even:  it  sends  inputs  which  are 
not  even  to  outputs  which  are  also  not  even)  but  it  is  terrible  when  it  comes 
to  taking  points  as  bodies  of  information.  There  is  little  reason  to  think  that 
a  body  of  information  x  warrants  the  negation  of  A  just  when  it  fails  to  war- 
rant A.  Bodies  of  information  can  be  incomplete  (warranting  neither  a  claim 
nor  its  negation)  and  they  can  be  inconsistent  (warranting  —  you  might  think 
misleadingly  —  a  claim  and  its  negation) .  Something  else  too  must  be  done 
to  model  negation.  Fine  had  a  treatment  of  negation  in  his  models,  but  it  too 
appeals  essentially  to  the  two-tiered  nature  of  a  model,  and  it  is  simpler  in  the 
Routley-Meyer  incarnation. 

3.3.2   Routley-Meyer  Frames 

Roudey  and  Meyer  [239,  240,  241,  242]  chose  to  keep  the  interpretation  for 
disjunction  simple,  and  to  generalise  the  interpretation  for  implication.  The 
central  feature  of  a  Routley-Meyer  frame  is  the  ternary  relation  R.  The  clause 
for  implication  then  is: 

o  x  Ih  A  — >  B  if  and  only  if  for  each  p ,  z  where  Rxpz,  ifp  Ih  A  then  z  Ih  B. 

This  is  a  generalisation  of  the  operational  semantics.  An  operational  frame  is 
a  Routley-Meyer  frame  where  Rxpz  holds  if  and  only  if  x  U  p  =  z.  The  inter- 
pretation of  R  is  similarly  a  generalisation  of  that  for  U.  Reading  the  implica- 
tion clause  "in  reverse"  (as  assigning  meaning  to  R  and  not  to  -^)86  we  have 
that  Rxpz  if  and  only  if  the  laws  (or  conditionals)  in  x,  applied  to  the  facts 

86Which,  frankly,  is  exactly  what  is  done  in  cases  of  interpreting  the  accessibility  relation  in  a 
modal  logic  as  "relative  possibility". 
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(antecedents)  in  y  give  outcomes  (consequents)  true  in  z.  Or  more  shortly, 
applying  x  to  y  gives  an  outcome  included  in  z.  That  this  is  a  genuine  rela- 
tion means  that  applying  x  to  y  might  give  no  outcome  at  all.  On  the  other 
hand,  it  we  might  have  Rxyz  and  Rxyz'  for  different  z  and  z'.  The  result  of 
applying  x  and  y  is  no  doubt  a  body  of  information,  but  it  might  not  be  a 
prime  body  of  information.  For  example  we  might  have  x  lh  A  — >  B  V  C  and 
y  lh  A.  Applying  the  information  in  x  to  that  in  y  will  give  B  V  C,  without 
giving  us  any  guidance  on  which  of  B  or  C  it  is  to  be.  And  this  is  possible 
even  if  x  is  prime  —  for  in  R  we  don't  have  the  counterintuitive  entailment 
A  — >  B  V  C  h  (A  — >  B)  V  (A  — >  C),  so  we  have  no  reason  to  think  that  x  might 
contain  either  A  — >  B  or  A  — >  C.  So,  in  this  case,  we'd  have  two  points  z  and 
z'  where  Rxyz  and  Rxyz'.  At  z  we  can  have  B  and  at  z'  we  can  have  C.  In  this 
way,  we  verify  A  — >  B  V  C  at  x,  all  the  time  using  prime  points.  We  only  have 

a  semantics  for  the  positive  part  of  R  when  endow  the  ternary  relation  R  with 
some  properties.  Routley  and  Meyer's  original  properties  are  best  stated  with 
the  use  of  some  shorthand. 

o  "R2abcd"  is  shorthand  for  (3x)(Rabx  A  Rxcd). 

o  Given  the  distinguished  point  0,  we  let  "a  C  b"  be  shorthand  for  ROab. 

These  abbreviations  make  sense,  given  the  interpretations  of  the  concepts  at 
hand.  R2abcd  conjoins  application.  You  apply  a  to  b  and  get  a  result  in  x  (for 
some  x)  which  we  then  apply  to  c  to  get  a  result  in  d.  One  way  of  thinking  of 
this  is  applying  a  to  b  and  applying  all  of  this  to  c.  The  inclusion  relation  is 
defined  by  looking  at  what  happens  when  you  apply  logic  to  a  state.  Applying 
logic  to  a  ought  result  in  nothing  more  than  a.  So,  if  ROab  if  and  only  if  a  is 
included  in  b. 

Fine  has  suggested  [97]  writing  Rabc  as  "b  ca  c",  and  reading  it  as:  ac- 
cording to  a,  b  is  contained  in  c.87  In  this  case,  C  is  C0,  containment  from  the 
point  of  view  of  logic. 

Here  are  the  postulates  Routley  and  Meyer  gave  to  make  their  semantics 
model  the  logic  R. 

o  (Identity)  ROaa  for  each  a. 

o  (Commutativity)  If  Rabc  then  Rbac. 

o  (Pasch's  Postulate)  If  R2 abed  then  R2acbd. 

o  (Idempotence)  Raaa  for  each  a. 

o  (Heredity)  If  Rabc  and  a'  C  a  then  Ra'bc. 

These  postulates  parallel  the  postulates  for  U.  Identity  and  heredity  govern 
the  behaviour  of  0,  making  it  fit  to  do  the  job  of  t,  and  to  be  a  place  to  witness 
logical  truths.  Commutativity  corresponds  to  the  commutativity  of  fusion, 
Pasch's  postulate  corresponds  to  B':  an  equivalent  postulate,  given  commu- 
tativity, would  be 

o  (Associativity)  If  R2  abed  then  R2a(bc)d. 

Where  "R2a(bc)d"  is  read  as  (3x)(Rbcx  A  Raxd). 

87This  is  a  plausible  proposal,  provided  that  you  are  aware  that  in  models  for  R,  jZ  a  may  fail  to 
be  reflexive,  as  is  needed  to  forma  counterexample  to  A  h  B  — >  B. 
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Idempotence  does  the  job  of  Wl.  So,  we  have  a  match  with  the  postulates 
for  an  operational  frame.  And  as  with  operational  frames,  ringing  the  changes 
with  regard  to  the  behaviour  of  R  will  result  in  different  logical  systems. 

Soundness  of  Routley-Meyer  models  is  a  straightforward  matter  of  show- 
ing that  each  provable  consecution  is  valid  on  each  model.  (A  valid  consecu- 
tion is,  as  usual,  one  which  is  preserved  at  every  point.)  To  interpret  conse- 
cutions, you  must  have  an  interpretation  of  fusion,  but  that  is  as  you  would 
expect. 

o  x  lh  A  o  B  iff  there  are  y,z  where  Ryzx,  y  lh  A  and  z  lh  B. 

A  fusion  is  true  at  a  point  in  a  model  when  it  is  the  composition  of  two  points, 
at  which  the  "fusejuncts"  are,  respectively,  true.  Logically  true  formulas  are 
then  always  true  at  0  in  a  Routley-Meyer  model. 

Demonstrating  completeness,  as  always  for  a  semantics  like  this,  is  much 
more  involved.  As  usual,  it  is  a  canonical  model  construction.  To  construct 
a  canonical  model  for  a  logic  like  R,  instead  of  dealing  with  all  theories,  as 
we  could  with  operational  models,  we  must  deal  in  prime  theories68  But  here, 
not  just  any  prime  theories  will  do.  In  these  models,  each  point  is  closed  un- 
der consequence  as  defined  at  the  point  0.  This  is  a  fundamental  fact  about 
Routley-Meyer  models: 

Fact  47  (Semantic  Entailment)  A  entailsB  in  a  Routley-Meyer  model  (that 
is,  for  all  x,  ifx  lh  A  thenx  lh  B)  if  and  only  if  0  lh  A  — >  B. 

This  means  that  these  points  are  not  only  prime  theories,  they  are  prime  0- 
theories.  They  are  closed  under  the  "logic  of  0."  And  here,  what  is  going  on  at 
0  may  be  more  than  just  genuinely  "logic"  according  to  the  logic  in  question. 
In  particular,  this  is  the  case  at  R,  at  least  if  negation  is  around.  (Bear  with 
the  fact  that  I  haven't  told  you  how  to  interpret  negation  yet.)  For  R  proves 
A  V  ~A,  and  so,  by  the  primeness  of  the  point  0,  we  will  have  either  0  lh  A  or 
0  lh  ~A.  In  any  particular  model,  0  will  validate  more  than  "logic  alone".  So,  to 
construct  a  model,  we  will  first  construct  a  prime  theory  T  for  the  base  point 
0,  and  then  the  other  points  in  the  model  will  be  prime  T-theories:  theories 
closed  under  the  inferences  licensed  by  T. 

Definition  48  (The  Prime  Theory  Canonical  Model  on  T)  Given  a  prime, 
regular  theory  T,  prime  theory  canonical  model  on  T  is  populated  by  prime  T- 
theories.  The  identity  point  is  T  itself.  The  ternary  relation  R  is  defined  as 
follows: 

RUVWif  and  only  if{B  :  (3A)(A  ->  B  e  U  and  A  e  V)}  C  W 
and  U  lh  A  if  and  only  if  A  e  U. 

Note  the  similarity  of  the  definition  of  R  here  to  the  definition  of  U  on  the 
canonical  theory  model  on  page  71.  Here,  R  is  defined  by  composition  of  the- 
ories, but  the  composition  of  two  prime  T-theories  may  not  itself  be  a  prime 
theory,  so  we  resort  to  the  ternary  relation. 

88Deflned  at  Definition  5  on  page  24. 
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Proving  that  this  is  indeed  a  model  is  a  matter  of  checking  all  of  the  clauses. 
The  difficult  conditions  are  the  existential  ones,  according  to  which  there  is 
a  point  in  the  model  with  certain  properties.  An  example  is  one  half  of  the 
implication  clause:  if  A  — >  B  ^  U,  we  need  to  find  V,  W  where  RUVW,  A  e  V 
and  B  ^  W.  This  is  a  matter  of  using  Belnap's  Pair  Extension  Theorem89  twice. 
First,  to  construct  V  you  use  the  pair  ({A},  {C  :  C  — >  B  e  U}),  and  extend  it  to 
get  a  prime  V.  Then,  for  W  you  use  the  pair  (U  U  V,{B}).  The  result  will  be 
the  two  prime  theories  you  wish.  The  same  techniques  work  for  the  other 
difficult  clauses.  The  canonical  prime  theory  model  is  indeed  a  model. 

This  shows  the  ubiquity  of  the  pair  extension  theorem  in  the  metatheory 
of  distributive  substructural  logics.  Prime  theories  play  the  part  here  of  con- 
sistent and  complete  theories  in  the  metatheory  of  classical  intensional  log- 
ics. 

I  have  said  nothing  about  the  treatment  of  negation.  Routley's  innovation 
is  to  understand  that  negation  can  be  modelled  by  another  operator  which 
takes  us  away  from  the  current  point  of  evaluation.  The  Boolean  clause  will 
not  do.  The  alternative  is  this: 

o  x  lh  ~A  if  and  only  if  x*  \f  A 

where  *  is  a  map  of  period  two  on  the  set  of  points  in  a  model.  That  is,  x**  =  x. 
This  indeed  suffices  to  make  ~  a  de  Morgan  negation  on  the  model.  Adding 
the  following  condition 

o  (Contraposition)  If  Rxijz  then  Rxz*y* 

results  in  the  model  validating  the  contraposition  axiom. 

There  has  been  a  great  deal  of  debate  centred  around  the  interpretation 
of  the  *  operator  [58,  59,  186]. 90  There  is  no  doubt  that  the  *  operation  is 
not  particularly  perspicuous  in  and  of  itself.  (Being  told  that  it  turns  set-ups 
"inside  out"  is  not  particularly  enlightening.)  Instead  of  pursuing  that  debate 
here  (which  largely  burned  out),  I  will  merely  quote  an  insight  from  Belnap 
and  Dunn: 

...  we  are  convinced  of  the  high  probability  that  a  mathemati- 
cal apparatus  of  such  proven  formal  power  will  eventually  find  its 
concrete  applications  and  its  resting  place  in  intuition  (think  of 
tensors).  [11,  page  164] 

This,  I  think,  has  been  borne  out  in  the  later  development  of  the  Routley- 
Meyer  semantics,  and  its  applications.  But  to  find  a  plausible  interpretation 
of  * ,  and  to  understand  the  semantics  more  fully,  we  need  to  work  with  it 
some  more.  As  it  stands  so  far,  the  Routley-Meyer  construction  might  seem 
ad  hoc  and  fit  simply  for  R  and  its  neighbours.  For  although  you  can  ring 
the  changes  with  some  of  the  rules  (commutativity,  associativity,  contraction) 
others  seem  hopelessly  fixed.  The  models,  as  they  stand,  do  not  seem  natural 
in  the  way  that  Kripke  models  for  modal  logic  do. 

This  is  merely  an  appearance.  Recent  work  (dating  from  the  1990s,  and 
chiefly  due  to  Dunn,  on  gaggle  theory)  has  shown  that  ternary  frames  are 

89  Fact  10  on  25. 

90Not  to  be  confused  with  the  *  of  display  logic,  which  simply  means  "not"  in  the  metatheory 
of  structures. 
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completely  natural  models  for  substructural  logics  in  just  the  same  way  as 
Kripke  models  interpret  normal  modal  logics  [85,  86,  87,  88,  234]. 

Definition  49  (Ternary  Frames)  A  ternary  frame  is  a  set  with  a  ternary  re- 
lation R  on  that  set.  The  connectives  A,  V,  T,  _L,  — >,  o,  <—  can  be  defined  on  a 
ternary  frame  as  follows: 

o  x  lh  A  A  B  if  and  only  if  x  lh  A  and  x  lh  B 

o  xlh  AVBifandonlyifxIh  Aorxlh  B 

o  x  lh  _L  never 

o  x  lh  T  always 

o  x  lh  A  — >  B  if  and  only  if  for  each  y,  z  where  Rxyz,  if  y  lh  A  then  z  lh  B. 
o  x  lh  A  <—  B  if  and  only  if  for  each  y,  z  where  Ryxz,  if  y  lh  A  then  z  lh  B. 
o  x  lh  A  o  B  iff  there  are  y,z  where  Ryzx,  y  lh  A  and  z  lh  B. 
Many  structural  rules  come  with  a  corresponding  conditions  on  R.91 

There  are  no  restrictions  on  R  in  such  an  interpretation.  It  models  distributive 
lattice  operations,  together  with  the  triple  (— >,  o,  <— ).  A  soundness  and  com- 
pleteness result,  using  standard  techniques,  works  for  this  frame  semantics. 

Interpreting  R  is  a  tricky  business,  as  we  have  seen.  Probably  no  non- 
circular  definition  (one  which  doesn't  appeal  to  conditionality)  will  be  pos- 
sible. However,  some  interesting  explications  of  R  have  been  tried  in  the  ap- 
plied semantics  of  relevant  logics.  One  answer  which  has  some  cachet  at 
present  explains  R  in  terms  of  situation  theory.  If  the  points  in  a  model  are  cir- 
cumstances or  situations  of  some  kind,  then  R  indicates  the  degree  to  which 
situations  can  carry  information  about  other  situations.  In  particular,  Rabc 
holds  just  when  circumstance  a  acts  as  a  information  channel  from  b  to  c. 
There  is  a  significant  growing  recent  literature  on  the  connections  between 
traditional  situation  theory  and  the  semantics  of  relevant  logics  [21,  18,  161, 
228,231]. 

Some  conditions  (such  as  the  condition  for  double  negation  elimination, 
which  is  too  complex  to  discuss  here  [235])  require  talk  of  a  relation  of  in- 
clusion between  points  in  models.  This  is  not  surprising,  if  in  the  intended 
interpretation,  points  are  possibly  incomplete  (think  of  models  for  intuition- 
istic  logic)  then  sometimes  the  relation  of  extension  or  inclusion  might  play  a 
role. 

Definition  50  (Inclusion)  c  is  an  inclusion  relation  on  a  ternary  frame  if 
and  only  if  it  is  reflexive,  transitive  and  asymmetric,  and  in  addition 

o  For  allx,i),zif  Rxyz  andx'  Cx.y'Cij  andz  C  z'  then  Rx'y'z'.92 
A  model  on  a  ternary  frame  with  an  inclusion  relation  must  satisfy  the  hered- 
itary condition  on  atomic  formulas 

o  If  x  lh  p  andx  c  x' then x' lh  p 

The  clause  linking  C  and  R  suffices  to  prove  the  hereditary  lemma:  complex 
formulas  involving  — >,  o,  <—  satisfy  the  hereditary  lemma  if  their  constituents 
do. 

91Some,  however,  require  an  inclusion  relation  jZ,  to  be  denned  below. 

92This  is  quite  a  defensible  condition  as  it  stands,  but  it's  more  general  than  it  needs  to  be  to 
prove  the  hereditary  lemma  for  all  formulas  [234,  Chapter  11]. 
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Inclusion  as  a  relation  between  points  in  a  model,  is  usually  simple  to  ex- 
plain given  an  interpretation  of  these  points.  If  points  are  situations,  then 
a  c  b  just  when  a  is  a  substitution  of  b.  The  situation  of  my  bedroom  is  a 
substitution  of  the  situation  of  my  house.  An  inconsistent  circumstance  de- 
scribed by  the  first  chapter  of  some  fiction  may  well  be  a  substitution  of  an 
inconsistent  circumstance  described  by  the  whole  book.  Given  these  explica- 
tions of  inclusion,  the  connection  between  it  and  R  is  plausible.  As  x  shrinks 
to  x',  it  connects  more  pairs  of  circumstances,  as  for  a  given  antecedent  cir- 
cumstance there  are  more  possible  consequent  circumstances.  Given  x,  as  y 
shrinks  to  y ',  again,  there  are  more  possible  consequent  circumstances,  as 
y '  gives  us  less  information  to  constrain  possible  consequents.  These  expli- 
cations are  probably  not  reductions  of  the  notion,  but  they  go  some  way  to 
explain  their  appeal  and  their  use. 

Another  significant  role  in  models  is  played  by  the  distinguished  point  0 
in  Routley-Meyer  models.  This  point  plays  the  part  of  modelling  t, 

Definition  51  (Truth  Set)  T  is  a  truthset  in  a  ternary  model  with  inclusion 
if  and  only  if. 

o  RTab  if  and  only  if  a  C  b. 
where  "RTab"  stands  for  (3x)(x  e  TARxab).  Atruthsetis  reduced  if  it  has  the 
form  (x:0Cx)  for  some  point  0. 

A  truth  set  does  the  job  of  recording  frame  consequence.  The  — >  formulas  true 
at  every  point  in  T  are  exactly  the  entailments  witnessed  by  the  entire  model. 
For  some  applications  (in  particular,  using  frames  to  prove  the  admissibility 
of  disjunctive  syllogism  [239])  reduced  truth  sets  are  desirable.  A  great  deal  of 
work  has  gone  in  to  showing  the  circumstances  in  which  a  logic  has  a  seman- 
tics with  a  reduced  truth  set  [255,  220,  227,  230].  On  the  other  hand,  in  our 
intended  interpretation,  it  is  by  no  means  obvious. 

Most  contentious  is  the  interpretation  of  negation.  Some  of  Dunn's  recent 
work,  however,  has  served  to  take  the  sting  out  of  *  [87,  233] .  Dunn  notes  that 
*  is  a  particular  case  of  a  more  understandable  clause  for  negation:93 

Definition  52  (Compatibility)  A  compatibility  relation  C  on  a  frame  is  an 
arbitrary  two-place  relation.  Negation  is  interpreted  using  C  as  follows: 

o  x  lh  ~A  if  and  only  if  for  all  y ,  if  xCy  then  y  \f  A. 
If  the  frame  uses  an  inclusion  relation,  compatibility  is  related  to  inclusion  as 
follows: 

o  IfxCy,  x'  C  xandy'  Cy  thenx'Cy'. 

If  you  think  of  x  and  y  as  compatible  just  when  there  are  no  clashes  between 
them,  then  these  clauses  are  defensible.  A  circumstance  warrants  ~A  just 
when  there's  no  compatible  circumstance  in  which  A  holds.  So,  ~A's  hold- 
ing in  a  circumstance  just  when  A  is  ruled  out  by  that  circumstance:  there 
is  no  compatible  circumstance  in  which  A.  If  circumstances  are  possibly  in- 
complete (they  might  be  compatible  with  more  than  just  themselves)  and  if 
they  are  possibly  inconsistent  (not  compatible  with  themselves:  it  contains 

93The  expression  in  terms  of  compatibility  is  mine.  Dunn  uses  a  relation  of  incompatibility, 
expressed  the  symbol  "±",  which  is  already  overloaded  here. 
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an  internal  contradiction)  then  we  have  counterexamples  to  the  paradoxes  of 
implication  from  before.  We  may  have  x  Ih  A  A  ~A  (when  it  is  not  the  case 
that  xCx)  or  we  may  have  x  1/  A  V  ~A  (when  x  1/  A,  but  xCy  where  y  Ih  A). 

But  what  of  the  dreaded  *?  It  can  be  seen  to  be  a  special  case  of  C.  The 
behaviour  of  C  is  wrapped  up  by  *  just  if  a*  is  the  unique  C -maximum  of 
the  set  {x  :  qCx}  of  all  points  compatible  with  a.  If  this  set  has  a  unique  C- 
maximum,  a*,  then  indeed  a  Ih  -A  just  when  a*  \f  A  [233]. 

So,  the  ternary  frame  semantics  can  be  "deconstructed"  into  individual 
components,  each  of  which  may  be  explained  and  applied  in  different  cir- 
cumstances. Here  are  some  other  examples  of  ternary  frames  which  have 
been  useful  in  the  study  of  substructural  logics. 

Example  53  (Two-Dimensional  Frames)  Given  a  set  D,  we  can  define  a 
frame  on  the  set  D  x  D  of  pairs  of  D  elements,  by  defining  the  ternary  re- 
lation R  on  D  x  D,  setting  R(a,  b)  (c,  d)  (e,  f)  if  and  only  if  b  =  c,  e  =  a  and 
b  =  f.  In  other  words,  (a,  b)  composes  with  (b,  c)  to  result  in  (a,  c),  and  no 
other  relations  hold  between  pairs.  So,  the  evaluation  conditions  on  these 
two-dimensional  frames  reduce  as  follows: 

o  (a,  b)  Ih  A  o  B  if  and  only  if  for  some  c  e  D,  (a,  c)  Ih  A  and  (c,  b)  Ih  B. 

o  (a,  b)  Ih  A  — >  B  if  and  only  if  for  each  c  e  D  if  (b,  c)  Ih  A  then  (a,  c)  Ih  B. 

o  (a,  b)  Ih  B  <-  A  if  and  only  if  for  each  c  e  D  if  (c,  a)  Ih  A  then  (c,  b)  Ih  B. 
In  this  frame,  the  ternary  R  reduces  to  a  partial  function  on  pairs.  This  func- 
tion is  associative  but  not  symmetric,  where  defined.  The  point  set  is  flat  — 
there  is  no  natural  notion  of  inclusion  to  be  imposed.  This  frame  has  a  truth 
set,  but  in  this  case  it  is  not  reduced:  it  is  the  set  {(a,  a)  :  a  e  D}. 

These  models  are  studied  by  van  Benthem,  Dosen  and  Orlowska  [37,  70, 
208]  in  the  context  of  substructural  logics,  and  they  have  blossomed  into  their 
own  industry,  under  the  suggestive  name  'arrow  logics'  [167].  In  these  logics, 
we  think  of  the  points  (a,  b)  as  transitions,  or  arrows,  from  a  to  b. 

These  are  important  frames  for  they  are  closely  related  to  language  frames 
in  a  number  of  respects  —  the  relation  R  is  functional:  if  Rxyz  and  Rxyz'  then 
z  =  z'.  However,  in  this  case  the  relation  is  partial.  For  some  x,  y  there  is  no  z 
such  that  Rxyz. 

Example  54  (Mitchell's  IE  models)  Mitchell's  IE  models,  are  models  of  lin- 
ear logic  with  distribution  of  A  over  V  [199].  In  these  models,  points  are 
pairs  (m,  n)  whose  elements  are  taken  from  a  commutative  monoid  R  of  re- 
sources. As  R  is  a  commutative  monoid,  there  is  an  operation  +  on  R,  such 
that  m  +  n  =  n  +  m,  with  an  identity  0,  such  that  m  +  0  =  m  =  0  +  m.  We 
evaluate  propositions  at  points  as  follows:  (m,n)  Ih  A  o  B  if  and  only  if  for 
some  ni,n2  where  n  =  m  +  H2,  (m  +  ni  ,ri2)  Ih  A  and  (m  +  n2,ni)  Ih  B. 
(m,  n)  Ih  ~A  if  and  only  if  (n,  m)  \f  A.  (m,  n)  Ih  A  — >  B  if  and  only  if  for  each 
mi , m.2  where  m  =  mi  +  m.2,  if  (n  +  m.2, mi)  Ih  A  then  (m.2,n  +  mi)  Ih  B. 
Conjunction,  disjunction,  T  and  _L  are  defined  in  the  usual  way. 

Early  antecedents  of  the  frame  semantics  for  substructural  logics  can  be 
found  in  Jonnson  and  Tarski's  work  on  the  representation  of  Boolean  alge- 
bras with  operators  [141,  142].  This  work  presents  what  amounts  to  a  sound- 
ness and  completeness  result  for  frames  of  substructural  logics  (with  Boolean 
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negation),  though  it  takes  a  certain  amount  of  hindsight  to  see  it  as  such.  The 
papers  are  written  very  much  from  the  perspective  of  algebra  and  represen- 
tation theory. 

An  extensive  study  of  the  properties  of  ternary  frames  is  given  in  Routley 
Meyer,  Brady  and  Plumwood's  Relevant  Logics  and  their  Rivals  [242].  Gab- 
bay  [105]  also  gave  a  ternary  relational  semantics  for  implication,  indepen- 
dently of  the  tradition  of  Routley  and  Meyer.  Frames  can  be  viewed  from  an 
algebraic  point  of  view.  The  class  of  propositions  of  the  frame  is  a  completely 
distributive  lattice  under  intersection  and  union,  and  it  is  equipped  with  the 
appropriate  operators,  defined  by  the  clauses  in  the  evaluation  conditions. 
For  example,  the  implication  clause  gives  us 

a— >  (3={x:  (Vy,ze  T)(Rxyz  — >  (y  e  a  — >  z  e  (3))} 

Similarly,  -a  =  {x  :  (Vy  e  T)  (xCy  — >  y  ^  a)},  and  so  on.  We  will  call  the  result- 
ing propositional  structure  'Alg(.F)'  the  algebra  of  the  frame  T.  Furthermore, 
any  interpretation  Ih  on  a  frame  gives  you  an  evaluation  vn-  given  by  setting 
vih(A)  to  be  [A].  The  connections  with  algebra  run  even  deeper.  Our  canon- 
ical model  is  constructed  out  of  prime  theories  in  a  language.  A  similar  con- 
struction can  work  with  the  prime  filters  of  a  propositional  structure.  Dunn's 
work  on  gaggles  [85,  86,  87,  88]  generalises  the  results  here  to  operators  with 
arbitrary  arity.  An  n-ary  operator  is  modelled  with  an  n  +  1  -place  relation. 

Duality  theory  is  the  study  of  the  relationship  between  algebras  and  their 
representations  in  terms  of  frames.  There  is  an  important  strand  of  recent 
work  in  the  semantics  of  substructural  logic  exploring  duality  theory  in  this 
context  [129,  130,  132,  247,  274]. 

Meyer  and  Mares  have  important  work  on  the  particular  case  of  adding  an 
S4-type  necessity  for  R  [164],  and  they  have  shown  that  disjunctive  syllogism 
is  admissible  in  this  case,  using  the  frame  semantics  to  prove  it.  Meyer  and 
Mares  have  also  studied  the  extensions  of  these  logics  with  Boolean  nega- 
tion [163,  185]. 

Study  of  the  frame  conditions  corresponding  to  rules  brings  forward  ques- 
tions of  canonicity  and  correspondence.  When  is  the  canonical  frame  for  a 
logic  itself  a  model  of  the  logic?  This  is  not  always  the  case  in  modal  logics, 
and  also,  not  always  the  case  in  substructural  logics.  There  has  been  some 
work  in  attempting  to  pin  down  the  class  of  substructural  logics  for  which 
canonicity  holds  [113,  148,  234]. 

Not  all  logics  have  connectives  which  are  amenable  to  the  treatment  of 
accessibility  relations.  We  will  see  this  when  we  consider  !  from  linear  logic. 
Another  case  is  the  counterfactual  conditional.  These  are  more  aptly  mod- 
elled by  neighbourhood  frames.  There  has  been  a  little  work  considering  how 
neighbourhood  frames  can  be  used  in  a  substructural  setting  [5, 104,  162]. 

3.3.3   Projective  Frames,  and  Undecidability 

R  is  undecidable.  Alasdair  Urquhart  proved  this  in  his  ground  breaking  pa- 
pers [269,  270].  The  general  idea  is  a  straightforward  one:  encode  a  known 
undecidable  problem  into  the  language  of  R.  Meyer  showed  how  to  do  this 
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in  the  1960s,  by  constructing  a  simple  substructural  logic,  such  that  deciding 
what  was  a  theorem  in  a  that  logic  would  enable  you  to  solve  the  word  prob- 
lem for  free  semigroups  [175,  193].  That  logic  was  not  particularly  natural. 
(It  was  the  Lambek  calculus  together  with  just  enough  contraction  to  enable 
you  to  represent  the  deducibility  problem  as  a  conditional.)  The  logic  was 
not  particularly  like  R.  The  insights  that  helped  decide  the  issue  for  R  came 
from  an  unexpected  quarter  —  projective  geometry.  To  see  why  projective 
geometry  gave  the  necessary  insights,  we  will  first  consider  a  simple  case,  the 
undecidability  of  the  system  KR.  KR  is  given  by  adding  A  A  ~A  — >  B  to  R.  A  KR 
frame  is  one  satisfying  the  following  conditions  (given  by  adding  the  clause 
that  a  =  a*  to  the  conditions  for  an  R  frame).94 

ROab  iff  a  =  b       Rabc  iff  Rbac  iff  Racb  (total  permutation) 
Raaa  for  each  a    R2  abed  only  if  R2acbd 

The  clauses  for  the  connectives  are  standard,  with  the  proviso  that  a  lh  ~A  iff 
a  \y  A,  since  a  =  a* . 

Urquhart's  first  important  insight  was  that  KR  frames  are  surprisingly  sim- 
ilar to  projective  spaces.  A  projective  space  Pisa  set  P  of  points,  and  a  collec- 
tion L  of  subsets  of  P  called  lines,  such  that  any  two  distinct  points  are  on  ex- 
actly one  line,  and  any  two  distinct  lines  intersect  in  exactly  one  point.  But  we 
can  define  projective  spaces  instead  through  the  ternary  relation  of  collinear- 
ity.  Given  a  projective  space  V,  its  collinearity  relation  C  is  a  ternary  relation 
satisfying  the  condition: 

Cabc  iff  a  =  b  =  c,  or  a,  b  and  c  are  distinct  and  they  lie  on  a 
common  line. 

If  V  is  a  projective  space,  then  its  collinearity  relation  C  satisfies  the  following 
conditions, 

Caaa  for  each  a.  Cabc  iff  Cbac  iff  Cacb.  C2  abed  only  if  C2acbd. 

provided  that  every  line  has  at  least  four  points  (this  last  requirement  is  nec- 
essary to  verify  the  last  condition).  Conversely,  if  we  have  a  set  with  a  ternary 
relation  C  satisfying  these  conditions,  then  the  space  defined  with  the  origi- 
nal set  as  points  and  the  sets  lab  =  {c  :  Cabc}  U  {a,  b}  where  a^bas  lines  is  a 
projective  space. 

Now  the  similarity  with  KR  frames  becomes  obvious.  If  V  is  a  projective 
space,  the  frame  T{V)  generated  by  P  is  given  by  adjoining  a  new  point  0, 
adding  the  conditions  COaa,  CaOa,  and  CaaO,  and  by  taking  the  extended 
relation  C  to  be  the  accessibility  relation  of  the  frame. 

Projective  spaces  have  a  naturally  associated  undecidable  problem.  The 
problem  arises  when  considering  the  linear  subspaces  of  projective  spaces. 
A  subspace  of  a  projective  space  is  a  subset  which  is  also  a  projective  space 
under  its  inherited  collinearity  relation.  Given  any  two  linear  subspaces  X 
and  y,  the  subspace  X  +  y  is  the  set  of  all  points  on  lines  through  points  in  X 
and  points  in  y. 

In  KR  frames  there  are  propositions  which  play  the  role  of  linear  subspaces 
in  projective  spaces.  We  need  a  convention  to  deal  with  the  extra  point  0,  and 
we  simply  decree  that  0  should  be  in  every  "subspace."  Then  linear  subspaces 

94My  presentation  of  these  results  is  indebted  to  many  discussions  with  Pragati  Jain  [140] . 
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are  equivalent  to  the  positive  idempotents  in  a  frame.  That  is,  they  are  the 
propositions  X  which  are  positive  (so  0  e  X)  and  idempotent  (so  X  =  X  o  X). 
Clearly  for  any  formula  A  and  any  KR  model  M ,  the  extension  of  A,  1 1  A|  |  in  M 
is  a  positive  idempotent  iff  0  Ih  A  A  (A  <->  A  o  A).  It  is  then  not  too  difficult 
to  show  that  if  A  and  B  are  positive  idempotents,  so  are  A  o  B  and  A  A  B,  and 
that  t  and  T  are  positive  idempotents. 

Given  a  projective  space  V,  the  lattice  algebra  (£,  n,  +)  of  all  linear  sub- 
spaces  of  the  projective  space,  under  intersection  and  +  is  a  modular  geo- 
metric lattice.  That  is,  it  is  a  complete  lattice,  satisfying  these  conditions: 

Modularity  a  >  c      (Vb)  (a  n  (b  +  c)  <  (a  n  b)  +  c) 

Geometricity  Every  lattice  element  is  a  join  of  atoms,  and  if  a  is  an  atom  and 
X  is  a  set  where  a  <  IX  then  there's  some  finite  Y  C  X,  where  a  <  IY. 

The  lattice  of  linear  subspaces  of  a  projective  space  satisfies  these  conditions, 
and  that  in  fact,  any  modular  geometric  lattice  is  isomorphic  to  the  lattice  of 
linear  subspaces  of  some  projective  space.  Furthermore  the  lattice  of  positive 
idempotents  of  any  KR  frame  is  also  a  modular  geometric  lattice. 

The  undecidable  problem  which  Urquhart  uses  to  prove  the  undecidabil- 
ity  of  KR  is  now  simple  to  state.  Hutchinson  [138]  and  Lipshitz  [157]  proved 
that 

Fact  55  (Modular  Lattice  Word  Problem)  The  word  problem  for  a  class 
of  modular  lattices  which  includes  the  subspace  lattice  of  an  infinite  dimen- 
sional projective  space  is  undecidable. 

Given  an  infinite  dimensional  projective  space  in  which  every  line  includes  at 
least  four  points  V,  the  logic  of  the  frame  [V]  is  said  to  be  a  strong  logic.  Our 
undecidability  theorem  then  goes  like  this: 

Fact  56  (Undecidability  for  KR)  Any  logic  between  KR  and  a  strong  logic 
is  undecidable. 

Proof  Consider  a  modular  lattice  problem 

If  vi  =  wi  . . .  vn  =  wn  then  v  =  w 

stated  in  a  language  with  variables  xi  (i  =  1,2,...)  constants  1  and  0,  and 
the  lattice  connectives  n  and  +.  Fix  a  map  into  the  language  of  KR  by  setting 
x\  —  pi  for  variables,  0*  =  t,  1*  =  T,  (vn  w)*  =  vt  A  w'  and  (v  +  w)1  =  v*  ow', 
The  translation  of  our  modular  lattice  problem  is  then  the  KR  formula 

(B  A  (vf  «-»  w])  A  ■  ■  ■  A  (v*  «-»  w*  )  At)  — >  (v*  <-»  w*] 

where  the  formula  B  is  the  conjunction  of  all  formulas  pi  A  (pi  <->  pi  o  pt)  for 
each  pi  appearing  in  the  formulae  v*  or  w*. 

We  will  show  that  given  a  particular  infinite  dimensional  projective  space 
(with  every  line  containing  at  least  four  points)  V,  then  the  word  problem 
is  valid  in  the  lattice  of  linear  subspaces  of  V  if  and  only  if  its  translation  is 
provable  in  L,  for  any  logic  L  intermediate  between  KR  and  the  logic  of  the 
frame  T{V). 
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If  the  translation  of  the  word  problem  is  valid  in  L,  then  it  holds  in  the 
frame  TCP).  Consider  the  word  problem.  If  it  were  invalid,  then  there  would 
be  linear  subspaces  xi  ,X2, . . .  in  the  space  V  such  that  each  Vi  =  wt  would  be 
true  while  v  ^  w.  Construct  a  model  on  the  frame  T[V)  as  follows.  Let  the 
extension  of  pt  be  the  space  xt  together  with  the  point  0.  It  is  then  simple  to 
show  that  0  Ih  B,  as  each  pt  is  a  positive  idempotent.  In  addition,  0  Ih  t,  and 
0  Ih  vj;  <-»  w\,  for  the  extension  of  each  vj;  and  will  be  the  spaces  picked 
out  by  vt  and  wt  (both  with  the  obligatory  0  added).  However,  we  would  have 
0  \f  vt  <->  w',  since  the  extensions  of  v*  and  w1  were  picked  out  to  differ.  This 
would  amount  to  a  counterexample  to  the  translation  of  the  word  problem, 
which  we  said  was  valid.  As  a  result,  the  word  problem  is  valid  in  the  space  V. 
The  converse  reasoning  is  straightforward.  Deciding  the  logic  would  give  us 
a  decision  for  the  word  problem.  □ 

Unfortunately,  these  techniques  do  not  work  for  systems  weaker  than  KR. 
The  proof  that  positive  idempotents  are  modular  uses  essentially  the  special 
properties  of  KR.  Not  every  positive  idempotent  in  R  is  modular.  Nonethe- 
less, the  techniques  of  the  proof  can  be  extended  to  apply  to  a  much  wider 
range  of  systems.  You  do  not  need  to  restrict  your  attention  to  modular  lat- 
tices to  construct  an  undecidable  word  problem.  But  to  do  that,  you  need  to 
examine  Lipshitz  and  Hutchinson's  proof  more  carefully.  In  the  rest  of  this 
section,  I  will  hint  at  the  structure  of  Urquhart's  undecidability  proof  for  R 
and  other  logics.  For  detail,  the  reader  is  urged  to  consult  Urquhart's  original 
paper  [270]  or  my  retelling  of  the  proof  [234,  Chapter  15] 

Lipshitz  and  Hutchinson  proved  that  the  word  problem  for  modular  lat- 
tices was  undecidable  by  embedding  into  that  problem  the  already  known 
undecidable  word  problem  for  semigroups.  It  is  enough  to  show  that  a  struc- 
ture can  define  a  free  associative  binary  operation,  for  then  you  will  have  the 
tools  for  representing  arbitrary  semigroup  problems.  Urquhart  showed  that 
this  could  be  done  without  resorting  to  the  full  power  of  a  modular  lattice. 

It  suffices  to  have  an  0 -structure,  and  a  modular  4 -frame  defined  within 
that  0-structure.  An  0-structure  is  a  set  equipped  with  the  following  struc- 
ture 

o  It  has  a  semilattice  join  operator  n,  defining  an  order  <; 
o  It  has  a  commutative  and  associative  binary  operator  +; 

ox<y       x  +  z  <y  +  z; 
o  0  +  x  =  x; 

o  y>0       xn(x  +  y)=x; 

A4-frame  in  a  0-structure  is  a  set  {qi  ,  a2,  a3,  a4}  U  {c^  :  i  ^  j,  i,  j  =  1 , . . .  ,4} 
such  that 

o  The  cus  are  independent.  If  G,  H  C  {<n  , . . . ,  a4}  then  (IG)  n  (IH)  =  I(G  n 
H)  (where  10  =  0) 

o  If  G  C  {a-| , . . . ,  04}  then  IG  is  modular 

o  at  +  at  =  at 

O  Cij  =  Cji 

o  at  +  dj  =  at  +  Cik;  Cy  n  ctj  =  0,  if  i  ^  ) 

o  (ai  +  ak)  n  (Cij  +  Cjk)  =  Cik  for  distinct  i,j,k 
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Given  the  4-frame,  we  can  define  a  semigroup  structure.  For  each  distinct  i,  j, 
we  define  the  set  Ly  to  be  {x  :  x  +  aj  =  at  +  aj  and  x  n  a.j  =  0}.  Then  if  b  e 
and  d  e  Ljk  where  i,j,k  are  distinct,  we  set  b  ®  d  =  (b  +  d)  n  (at  +  ak).  It 
follows  (through  some  manipulation)  that  b  ®  de  Uk.  Then,  we  can  define 
a  semigroup  operation  '.'  on  Lu  by: 

x.y  =  (x®  c23)  ®  (c3i  ®  y) 

It  is  quite  an  involved  operation  to  show  that  this  is  associative.  Further- 
more, in  certain  circumstances,  the  operation  is  freely  associative.  Given  a 
countably  infinite-dimensional  vector  space  V,  its  lattice  of  subspaces  is  a  0- 
structure,  and  it  is  possible  to  define  a  modular  4-frame  in  this  lattice  of  sub- 
spaces,  such  that  any  countable  semigroup  is  isomorphic  to  a  subsemigroup 
of  Li  2  under  the  defined  associative  operation. 

The  rest  of  the  work  of  the  undecidability  proof  involves  showing  that 
this  construction  can  be  modelled  in  a  logic.  Perhaps  surprisingly,  it  can 
all  be  done  in  a  weak  logic  like  TW[A,  V,  — >,  T,  _L].  We  can  do  without  nega- 
tion by  defining  it  implicationally  as  usual:  Pick  a  distinguished  propositional 
atom  f,  and  by  defining  —  A  to  be  A  — >  f ,  t  to  be  — f,  and  A :  B  to  be  —(A  — > 

— B).  A  is  a  regular  proposition  iff  A  <-»  A  is  provable.  The  regular 

propositions  form  an  0-structure,  under  the  assumption  of  the  formula  G  = 

{R(t,f,T,_L),N(t,f,T,_L),-T  <->  _L}.  where  R(  A)  is  Ah  A,  N(A)  is  (t  -> 

A)  — >  A,  and  R(A,  B, . . .)  is  R(A)  A  R(B)  A  •  •  •  and  similarly  for  N.  So,  we  can 
show  that  the  conditions  for  an  0-structure  hold  in  the  regular  propositions, 
assuming  Q  as  a  premise.  To  interpret  the  0-structure  conditions  we  model  n 
by  A,  +  by  :  and  0  by  t.  To  model  a  4-frame  in  the  0-structure,  Define  K(A)  to 
beR(A)  A  (A  A  —A  <->  _L)  A  (A  V  —A  hT)A(A:  -  Ah  -A)  A  (A  <->  A:  A). 
Then  we  can  show  the  following 

K(A),R(B,C),Ch  A  =4>  AA(B:C)h(AAB):C 

Then  the  conditions  for  a  4-frame  go  as  follows:  Choose  distinct  atomic  for- 
mulas Ai , . . . ,  A4  and  Cu, . . . ,  C34  to  match  ai , . . . ,  a.4  and  en, . . . ,  C34.  One 
independence  axiom  is  then 

(At  :A2:A3)A(A2:A3:A4)  <-»  (A2:A3) 

and  one  modularity  condition  is 

K(A,  :A3:A4) 

Let  IT  be  the  conjunction  of  the  statements  expressing  that  the  propositions 
At  and  form  a  4-frame  in  the  0-structure  of  regular  propositions.  In  any  al- 
gebra in  which  Gu  IT  is  true,  the  lattice  of  regular  propositions  is  a  0-structure, 
and  the  denotations  of  the  propositions  At  and  Qj  form  a  4-frame.  Finally, 
when  coding  up  a  semigroup  problem  with  variables  xi  ,x2, . . .  ,xm,  we  will 
need  formulas  doing  duty  for  these  variables:  We  need  a  condition  to  pick 
out  the  fact  that  pi  (standing  for  xO  is  in  Li2.  We  define  L(p)  to  be  (p  :  A2  <-» 
Ai  :  A2)  A  (p  A  A2  <— >  t).  Then  the  semigroup  operation  on  elements  of  L]2 
can  be  defined  in  terms  of  A  and  :  and  the  formulas  At  and  .  We  assume 
that  done,  and  we  will  take  it  that  there  is  an  operation  •  on  formulas  which 
picks  out  the  operation  on  L]2.  Then  we  have  the  following: 
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Fact  57  (Deducibility  from  TW  to  KR  is  undecidable)  For  any  logic  be- 
tween TW[A,  V,  — >,  T,  _L]  and  KR,  the  Hilbert  deducibility  problem  is  undecid- 
able. 

Proof  Take  a  semigroup  problem  which  is  known  to  be  undecidable.  It  may 
be  presented  in  the  following  way 

If  vi  =  wi  . . .  vn  =  wn  then  v  =  w 

where  each  term  vi ,  wt  is  a  term  in  the  language  of  semigroups,  constructed 
out  of  the  variables  xi ,  X2 , . . . ,  xm  for  some  m.  The  translation  of  that  problem 
into  the  language  of  TW[A,  V,  — >,  T,  _L]  is  the  deducibility  problem 

e,n,L(p1,...)pm),v^  <->w},...,v*  hw^v'hw* 

where  each  the  translation  ul  of  each  term  u  is  defined  recursively  by  setting 
to  be  pi,  and  (ui  .U2)1  to  be  uj  •  u\. 

For  any  logic  between  TW  and  KR  the  word  problem  in  semigroups  is  valid 
if  and  only  if  its  translation  is  valid  in  that  logic.  If  the  word  problem  is  valid  in 
the  theory  of  semigroups,  its  translation  must  be  valid,  for  given  the  truth  of 
8  and  FT  and  L(pi , . . .  ,pm),  the  operator  •  is  provably  a  semigroup  operation 
on  the  propositions  in  L12  in  the  algebra  of  the  logic,  and  the  terms  v\  and 
wi  satisfy  the  semigroup  conditions.  As  a  result,  v1  and  w*  pick  out  the  same 
propositions,  and  we  have  a  proof  of  vl  <-»  wl. 

Conversely  if  the  word  problem  is  invalid,  then  it  has  an  interpretation  in 
the  semigroup  S  defined  on  Li 2  in  the  lattice  of  subspaces  of  an  infinite  di- 
mensional vector  space.  The  lattice  of  subspaces  of  this  vector  space  is  the 
O-structure  in  our  countermodel.  Consider  the  argument  for  KR.  There,  the 
subspaces  were  the  positive  idempotents  in  the  frame.  The  other  proposi- 
tions in  the  frame  were  arbitrary  subsets  of  points.  Something  similar  can 
work  here.  On  the  vector  space,  consider  the  subsets  of  points  which  are 
closed  under  multiplication  (that  is,  if  x  e  a,  so  is  kx,  where  k  is  taken  from 
the  field  of  the  vector  space).  This  is  a  De  Morgan  algebra,  defining  conjunc- 
tion and  disjunction  by  means  of  intersection  and  union  as  is  usual.  Negation 
is  modelled  by  set  difference.  The  fusion  a  o  |3  of  two  sets  of  points  is  the  set 
{x  +  y  :  x  e  a  andy  e  |3}.  It  is  not  too  difficult  to  show  that  this  is  commu- 
tative and  associative,  and  square  increasing,  when  the  vector  space  is  in  a 
field  of  characteristic  other  than  2,  since  if  x  e  a  then  x  =  \x  +  \x  e  a  o  a. 
Then  a  — >  (3  is  simply  —  (a  o  — 13).  This  is  an  algebraic  model  for  KR,  and  that 
the  regular  propositions  in  this  model  are  exactly  the  subspaces  of  the  vector 
space.  It  follows  that  our  counterexample  in  the  O-structure  is  a  counterex- 
ample in  a  model  of  KR  to  the  translation  of  the  word  problem.  As  a  result, 
the  translation  is  not  provable  in  KR  or  in  any  weaker  logic.  □ 
This  result  applies  to  systems  between  TW  and  KR,  and  it  shows  that  the  de- 
ducibility problem  is  undecidable  for  any  of  these  systems.  In  the  presence  of 
the  modus  ponens  axiom  AA(A^B)At-iB,  this  immediately  yields  the 
undecidability  of  the  theoremhood  problem,  as  the  deducibility  problem  can 
be  rewritten  as  a  single  formula. 

(O  A  TT  A  L(pi , . . . , Pm)  A  (v!h  w} )  A  •  •  •  A  (v*  <->  w*  )  A  t)  ->  (v1  <->  w1 ) 

As  a  result,  the  theoremhood  problem  for  logics  between  T  and  KR  is  unde- 
cidable. In  particular,  R,  E  and  T  are  all  undecidable. 
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The  restriction  to  TW  is  necessary  in  the  theorem.  Without  the  prefixing 
and  suffixing  axioms,  you  cannot  show  that  the  lattice  of  regular  propositions 
is  closed  under  the  'fusion-like'  connective  ' : '. 

Before  moving  on  to  our  next  section,  let  us  mention  that  these  geometric 
methods  have  been  useful  not  only  in  proving  the  undecidability  of  logics, 
but  also  in  showing  that  interpolation  fails  in  R  and  related  logics  [273] . 

3.3.4   Phase  Spaces 

Not  all  substructural  logics  are  distributive,  and  not  all  point  models  validate 
distribution.  In  this  section,  we  will  look  at  phase  spaces  for  linear  logic  as 
an  example  of  a  frame  invalidating  distribution.  Before  launching  into  the 
definition  (due  to  Girard  [1 17])  I  will  set  the  scene  with  some  historical  prece- 
dents. 

An  important  idea  germane  to  the  representation  of  non- distributive  lat- 
tices is  that  of  a  Dedekind-MacNeille  closure  [67,  125,  130, 159,  264]. 

Example  58  (Dedekind-MacNeille  Frames)  Consider  a  poset  with  order 
C.  Define  'y  C  a  for  a  set  a  to  mean  y  C  x  for  each  x  e  a.  Then  the  closure 
fa  of  a  set  a  of  points  can  be  given  as  follows: 

Ta  =  {z  :  Vy(y  C  a  ->  y  C  z)} 

Consider  the  closure  operation  on  the  class  of  all  theories  of  some  logic.  If  cx 
is  a  set  of  theories,  then  suppose  y  C  a  —  that  is,  y  C  x  for  each  x  e  a.  This  is 
equivalent  to  saying  that  y  C  f]  a:  y  is  no  bigger  than  the  intersection  of  the 
set  of  the  theories  in  a,  which  is  itself  a  theory.  So,  if  y  C  z,  then  we  must  have 
f|«Cz  too.  If  x  e  Fa,  then  anything  true  in  all  of  a  must  also  be  true  in  x. 
So  in  these  frames,  to  model  disjunction  we  require  x  lh  A  V  B  if  and  only  if 
xer([A]u[B]). 

Sambin  and  others  have  used  the  notion  of  a  "pretopology"  (in  our  language, 
a  set  with  a  closure  operator)  not  only  as  a  model  of  substructural  logics  but 
also  as  a  constructive  generalisation  of  a  topological  space  [131,  244,  245, 
246].  Dosen  [68,  69],  Ono  and  Komori  [205],  and  Ono  [206]  have  also  used 
given  semantics  involving  a  closure  operation 

This  is  not  the  only  way  to  avoid  distribution.  In  a  model  without  a  notion 
of  inclusion,  we  can  get  by  with  a  negation  to  define  a  closure  operator: 

Example  59  (Goldblatt  Frames)  Consider  orthologic:  an  ortho-negation 
combined  lattice  logic.  Here  a  frame  will  most  likely  appeal  to  a  two-place 
compatibility  relation  C  to  deal  with  negation.  The  compatibility  relation  is 

reflexive  (so  A  A -A  h  _L)  and  symmetric  (so  A  I  A).  Robert  Goldblatt  [122] 

showed  (in  1974)  how  to  deal  with  disjunction  by  considering  a  simple  com- 
patibility frame  (P,  C),  where  P  is  a  set  of  points  (unordered  by  any  inclusion 
relation)  and  C  is  a  symmetric,  reflexive  compatibility  relation  on  P.95  Con- 
junction and  negation  are  modelled  in  the  standard  way: 
o  xlh  AABiffxIh  AandxIhB 

95J.  L.  Bell  presents  an  interesting  philosophical  analysis  of  Goldblatt's  semantics,  in  which  C 
is  understood  as  proximity  [24] . 
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o  x  II-  ~A  iff  for  each  y  where  xCy ,  y  \f  A 
However,  as  it  stands,  this  semantics  does  not  validate  — A  h  A.  To  add  an  ex- 
tra condition  on  C  to  validate  double  negation  elimination  would  result  in  C 
being  the  identity  relation  and  the  logic  would  collapse  into  classical  proposi- 
tional  logic.  Goldblatt's  insight  was  to  instead  restrict  the  evaluation  of  propo- 
sitions on  the  frame  to  those  propositions  for  which  — A  h  A  is  valid.  In  the 
process,  you  reject  distribution. 

Given  a  set  a  C  P,  let  a~  =  {y  :  Vx(x  e  a  — >  ~yCx)},  or  equivalently, 
{y  :  Vx(yCx  — >  x  ^  a)}.  Therefore,  for  any  evaluation,  [A]~  =  J~A].  A  set 
a  c  Pis  said  to  be  C -closed  if  and  only  if  a  =  a  .  The  C-closed  sets  will 
model  our  propositions.  Since  C  is  symmetric,  a  C  a~~. 

A  disjunction  is  true  not  only  at  the  points  at  which  either  disjunct  is  true 
but  also  at  the  closure  of  that  set  of  points.  Here,  however,  it  is  C-closure  at 
work. 

o  xlhAVBiffxe  ([A]U[B])~~ 

Girard's  phase  spaces  (1987)  are  a  generalisation  of  Goldblatt's  compatibility 
frames  (discovered  independently  of  Goldblatt's  work,  despite  being  10  years 
later). 

Example  60  (Phase  Spaces)  A  phase  space  is  a  quadruple  (P,  • ,  1 , 0)  in  which 
(P,  •,  1)  is  a  commutative  monoid,  and  in  which  0  is  a  distinguished  subset 
of  P.  The  elements  of  P  are  phases,  and  0  is  the  set  of  orthogonal  phases  of 
P.96  In  a  phase  space,  the  binary  operator  •  is  used  for  the  ternary  relation  for 
implication.  Here,  Pocyz  if  and  only  if  x  •  y  =z.  For  any  subset  G  C  P,  the  dual 
G~  of  G  is  defined  as  follows: 

G~  ={x  e  P  :  for  ally  e  G(x-y  €  0)} 

In  other  words,  G~  is  the  set  of  all  objects  which  send  each  element  of  G  (by 
the  monoid  operation)  to  0.  For  any  set  G  of  phases,  G~~  is  the  closure  of  G. 
It  is  not  too  hard  to  verify  that  this  is  indeed  a  closure  operation,  by  showing 
the  following: 

o  G  C  G~~. 

o  G —  =  G~. 

o  If  G  c  HthenH~  C  G~. 

o  G  =  G~~  iff  G  =  H~  for  some  H  C  P. 
The  closed  sets  are  called  facts.  The  set  of  facts  can  be  equipped  with  a  natural 
monoid  operation,  (G  •  H)~~,  where  G  •  H  is  defined  in  the  obvious  way  as 
{x  •  y  :  x  e  G  and  y  e  H}.  This  operation  is  residuated  by  the  operation  — > 
defined  by  setting  G  — >  H  =  {x  :  Vy  e  G(xy  e  H)},  which  can  be  shown  to 
equal  (GH~)~. 

For  negation,  we  define  xCy  to  hold  if  and  only  if  x  •  y  ^  0.  C  is  symmetric, 
given  the  commutativity  composition,  and  the  negation  of  a  fact  G  is  G~.  The 
negation  of  a  fact  is  itself  a  fact. 

It  follows  that  this  is  a  model  for  linear  logic  without  exponentials.  R  satis- 
fies the  conditions  for  C  and  B,  as  composition  is  associative  and  commuta- 
tive. The  set  1  =  {1 }~~  is  the  identity  (both  left  and  right)  for  fusion. 

96In  the  linear  logic  literature,  '_L '  is  used  instead  of  '0'  for  the  set  of  orthogonal  phases.  We  use 
±  for  the  bottom  element  of  a  lattice,  so  we  will  use  0  for  the  set  of  orthogonal  phases. 
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Phase  spaces  are  a  particular  kind  of  closure  frame.  They  are  special  in  a  num- 
ber of  ways.  Not  only  is  the  closure  operation  defined  by  negation,  and  not 
only  are  the  structural  rules  B  and  C  satisfied,  but  the  accessibility  relation  un- 
derlying the  frame  is  functional.  Nevertheless,  phase  spaces  are  still  a  faithful 
model  for  linear  logic.  We  have  the  following  theorem. 

Fact  61  (Soundness  and  Completeness  in  Phase  Spaces)  X  I-  A  is  prov- 
able in  linear  logic  if  and  only  ifX  h  A  holds  in  all  phase  spaces. 

Proof  The  soundness  result  is  straightforward  as  usual.  For  completeness, 
we  construct  the  canonical  phase  space  out  of  formulas.  The  operator  •  on 
this  frame  is  fusion.  If  you  wish  to  think  of  a  ternary  relation,  think  of  RABC 
iff  A  =  B  o  C.  Then  for  0,  we  have  {A  :  0  I — A}.  The  false  elements  are  the 
set  of  all  formulae  whose  negations  are  provable,  as  you  would  expect.  This 
is  the  correct  choice,  as  G~  =  {A  :  VB  e  G(B  h  ~A)},  and  so,  G~~  =  {A  :  VB  e 
G~(B  I — A)}  =  {A  :  VB(VC  e  G  where  C  I — B)B  h  ~A}.  Verifying  the  details 
is  no  more  difficult  in  this  case  than  in  Urquhart's  operational  models  for  the 
conjunction/implication  fragment  of  R.  □ 

The  definition  of  a  phase  space  gives  us  a  nice  result.  It  motivates  an  em- 
bedding of  the  whole  of  multiplicative  additive  linear  logic  into  its  (— >,  A,t) 
fragment.  You  choose  f  to  be  some  arbitrary  proposition,  a  translation  as  fol- 
lows (where  we  set  ~A  =  A  — »  f). 


Fact  62  (Embedding  using  A,  — >  and  t)  A  h  B  holds  in  multiplicative,  ad- 
ditive linear  logic  in  if  and  only  if  A1  \-  B*  in  the  (— > ,  A,  t)  fragment. 

Proof  First,  if  A*  h  B*  is  provable  then  A*  h  B*  is  provable  in  linear  logic, 
and  in  particular,  it  is  provable  when  we  choose  ~t  for  f.  In  this  case,  A*  is 
equivalent  to  A  in  linear  logic,  and  therefore,  A  h  B  is  provable. 

Conversely,  if  A1  h  B*  does  not  hold  in  the  (— >,  A,  t)  fragment  then  in  the 
canonical  model  (constructed  simply  out  of  theories)  we  have  a  counterex- 
ample to  A'  h  B*.  Construct  a  phase  space  out  of  this  model.  The  phases  are 
the  theories  in  the  canonical  model.  The  monoid  operation  is  theory  fusion, 
and  the  set  0  is  {x  :  f  e  x}.  It  is  straightforward  to  check  that  any  set  of  the 
form  [ — A]  in  the  original  canonical  model  is  a  fact  in  the  phase  space  we 
are  constructing.  Construct  an  interpretation  of  the  language  of  linear  logic 
by  setting  [A]  in  the  phase  space  to  be  [A1]  in  the  canonical  model.  As  the 
definition  of  the  translation 1  mimics  the  evaluation  clauses  in  a  phase  space, 
this  is  an  acceptable  phase  space  evaluation,  and  it  is  one  which  invalidates 
A  h  B,  so  this  consecution  fails  in  linear  logic.  □ 


(AAB)' 
(A  VB)* 
(AoB)1 
(A  — >  B)1 


HA'AB*) 
-(-A1  A-B1) 
~(At  — >  -B1) 
A1  ->  B1 


V 


t 
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Note  that  this  construction  works  in  logics  other  than  linear  logic.  For  exam- 
ple, it  will  work  to  embed  the  whole  of  R  without  distribution  into  R[— >,  A,  t], 
for  if  the  original  model  satisfies  W,  so  will  the  phase  model  for  R  without 
distribution. 

We  will  end  this  section  by  sketching  how  to  cope  with  non-normal  modal 
operators,  such  as  !  and  ?  of  linear  logic.  The  difficulty  with  operators  like 
these  is  the  way  the  distribution  properties  of  normal  operators  fail.  We  do 
nothave!AA!B  F-  !(AAB).  So,  we  cannot  use  standard  accessibility  relations. 
However,  something  is  possible. 

Definition  63  (Topolinear  Spaces)  A  phase  space  with  a  set  $  of  facts  sat- 
isfying the  following  conditions: 

o  IfX  c  ythenf|X  e  £. 

o  If  F,  G  e  3  then  F  +  G  e  £. 

o  If F  e  £thenF  +  F  =  F. 

o  ns=o. 

is  called  a  topolinear  space.  G  is  a  closed  fact  iff  G  e  3,  and  G  is  a  open  fact  iff 

G~  e 

Now,  given  any  fact  G,  the  consideration  of  G,  ?G,  is 

?G  =  p|{F  :  G  C  F  and  F  e 

It  is  simply  the  smallest  element  of  #  containing  G.  Its  dual,  the  affirmation 
of  G,  !G  is 

!G  =  (|J{H  :  H  C  G  and  FT  e 
These  are  duals,  as  you  can  readily  check. 

Lemma  64  (Duality  of  !  and  ?)  For  any  fact  G,  !(G~)  =  (?G)~,  and  dually, 
?(G~)  =  (!G)~.  □ 

This  definition  gives  us  a  semantics  for  the  exponentials.  The  semantics  does 
as  we  would  expect:  by  construction  G  C  ?G,  for  any  fact  G,  so  by  duality, 
!G  C  G.  Furthermore,  ?G  is  itself  a  closed  fact,  so  ?G  =  ??G,  and  dually,  !G  = 
!!G.  Similarly,  all  of  the  closed  facts  are  fixed  points  for  fission,  ?G  +  ?G  =  ?G, 
and  by  duality,  !G  o  !G  =  !G.  Finally,  0  C  ?G  by  construction,  so  by  duality 
!GCt,  and  by  the  behaviour  of  t,  G  o  t  =  t  gives  F  o  !G  C  F. 

Each  of  these  simple  verifications  shows  that  the  construction  of  !  and  ? 
satisfies  the  rules  for  the  exponentials  in  linear  logic.  This  gives  us  the  first 
part  of  the  following  fact. 

Fact  65  (Soundness  and  Completeness  in  Topolinear  Spaces)  X  F-  A  is 
provable  in  LL  if  and  only  ifX  F-  A  holds  in  all  topolinear  spaces. 

Proof  As  we  have  seen,  the  rules  for  the  exponentials  hold  in  topolinear 
spaces.  For  the  converse,  we  must  verify  that  the  canonical  topolinear  space 
satisfies  the  conditions  required  for  a  topolinear  space.  So  how  should  we 
construct  the  canonical  topolinear  space? 

We  will  use  the  canonical  phase  space  we  have  seen  to  construct  a  set  of 
closed  facts.  Obviously,  each  {A  :  ?B  F-  A}  ought  to  be  a  closed  fact  for  any 
choice  of  B.  This  cannot  be  the  whole  thing,  as  the  intersection  of  a  class 
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of  closed  facts  is  not  necessarily  a  set  of  the  form  {A  :  ?B  h  A}.  So  we  add 
these  intersections.  For  any  class  of  formulae  Bi,  we  will  let  O^A  :  ?Bi  h  A} 
be  a  closed  fact.  Once  we  do  this,  it  is  straightforward  to  check  that  ?[A]  = 
P){F  :  [A]  C  F  and  FeJ)  for  any  formula  A  in  the  canonical  model  structure. 
The  duality  of  ?  and  !,  together  with  the  duality  of  their  defining  conditions, 
ensures  that  the  result  for  !  holds  too.  □ 

This  kind  of  closure  operation  works  well  to  model  the  exponentials  in  phase 
spaces. 

4   Loose  Ends 

Let  me  end  this  whirlwind  tour  through  the  history  of  substructural  logic  by 
indicating  what  I  take  to  be  some  interesting  directions  for  further  research. 

4.1  Paradox 

Untutored  intuitions  about  collections  might  lead  you  to  believe  that  for  any 
properly,  there  is  a  collection  of  all  and  only  those  things  which  have  that 
properly.  Formally  you  might  try  this: 

a  e  {x  :  4>(x)}  Hh  4>(a) 

A  object  a  is  in  the  collection  {x  :  4>(x)}  of  all  of  the  cps  if  and  only  if  a  has 
properly  4>.  This  is  the  naive  membership  scheme.  Russell  has  shown  that 
from  naive  membership  scheme,  paradox  follows.  Consider  the  Russell  set 
{x  :  x  ^  x}.  As  an  instance  of  the  general  scheme  of  membership,  we  have 
Russell's  paradox: 

{x  :  x  ^  x}  e  {x  :  x  ^  x}  <->  {x  :  x  ^  x}  ^  {x  :  x  ^  x} 

The  Russell  set  is  a  member  of  itself  if  and  only  if  it  is  not  a  member  of  itself. 
In  many  traditional  logics  (classical  or  intuitionistic  propositional  logic,  for 
example)  from  pn-phpA-p,  and  from  this,  anything  at  all. 

The  mainstream  response  to  Russell's  paradox  is  to  calm  our  enthusiasm 
for  the  naive  membership  scheme  and  to  hunt  around  for  weaker  theories  of 
set  membership  which  are  not  so  extravagant.97 

However,  this  is  not  the  only  possibility.  There  is  a  motivation  to  consider 
logics  in  which  we  can  retain  the  naive  membership  scheme.  Clearly,  some- 
thing must  be  done  with  the  logic  of  negation,  as  we  wish  to  retain  propo- 
sitions p  such  that  p  <->  ~p,  without  everything  following  from  this.  There 
are  generally  two  options,  logics  with  "gaps"  or  "gluts,"  corresponding  to  the 
point  in  the  inference  from  p  <->  ~p  to  p  A~p  to  _L  which  is  taken  to  fail.  A  logic 
allows  "gaps"  if  it  the  first  inference  fails,  for  p  would  then  be  "neither  true  nor 
false."  A  logic  allows  "gluts"  if  the  second  inference  fails.  Plenty  of  work  has 
been  done  on  both  options  for  a  number  of  years  [115,  116,216,  226] 

However,  it  is  not  just  the  logic  of  negation  which  must  be  non-classical 
in  order  to  retain  the  naive  membership  scheme.  Curry's  paradox  [108,  194]. 

97There  is  some  interesting  work  in  this  area,  attempting  to  admit  sets  which  are  self- 
membered,  without  paradox  [3,  20]. 
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Curry's  paradox  shows  that  more  must  be  done,  if  the  logic  is  to  contain  im- 
plication. Consider  {x  :  x  e  x  — >  F},  for  some  false  proposition  F. 

{x:x€x->F)€{x:x€x-)F}H({x:x€x->F)€{x:xgx-)F}^F) 

This  paradox  reveals  that  there  is  a  proposition  p  such  that  p  <->  (p  — >  F),  and 
as  the  following  deduction  shows,  it  is  hard  to  avoid  the  inference  to  F: 

php^F  php 
 —  HE) 

p;p  h  F 

 [Wl] 

phF  php-)F  php 

■  ME) 


Ohp^F  p^Fhp  p;p  h  F 

  [Cut]    [Wl] 

hp  phF 

 [Cut] 

h  F 

As  the  choice  of  F  is  arbitrary,  we  must  attempt  to  stop  this  somewhere.  A 
number  of  people  have  taken  the  step  of  contraction  as  the  one  to  blame  [44, 
49,  46,  51,  287].  However,  contraction  is  a  useful  inferential  move.  It  is  re- 
quired in  mathematical  induction.  The  step  to,  say,  F5  from  FO  A  (Vx)(Fx  — > 
Fx  +  1 )  requires  the  use  of  the  premise  no  less  than  six  times.  Doing  without 
contraction  seems  a  little  like  cutting  off  one's  nose  to  spite  one's  face.  Can 
better  be  done  here? 


4.2   Relevant  Predication 

Dunn's  Relevant  Predication  program  is  an  interesting  application  of  relevant 
logic  to  the  clarification  of  philosophical  issues  [82,  83,  84,  90,  89,  146].  A 
theory  of  relevant  implication  is  used  to  attempt  to  mark  out  the  distinction 
between  genuine  properties —  say,  my  height,  which  is  a  genuine  properly  of 
me  —  and  fake  properties  —  say,  my  height,  as  a  fake  property  of  you.  I  am 
indeed  such  that  I  am  under  1 .8  metres  tall,  and  you  are  such  that  I  am  under 
1 .8  metres  tall.  But  in  the  first  case  I  have  described  how  I  am,  and  in  the 
second,  I  haven't  described  any  genuine  property  of  you. 

Classical  logic  is  not  good  at  marking  out  such  a  distinction,  for  if  Hx 
stands  for  'x  is  under  1 .8  metres  tall',  and  g  stands  for  Greg,  and  h  stands  for 
you,  then  Hx  is  true  of  x  iff  it  is  under  1 .8  metres  tall,  and  (Hg  A  x  =  x)  V  Hg 
is  true  of  something  iff  I  am  under  1 .8  metres  tall.  Why  is  one  a  'real'  property 
and  the  other  not?  If  we  can  reason  using  relevant  implication,  we  can  make 
the  following  distinction:  It  is  true  that  if  x  is  Greg  then  x  is  under  1 .8  metres. 
However,  it  is  not  true  that  if  x  is  you  then  Greg  is  under  1 .8  metres.  At  least, 
it  is  plausible  that  this  conditional  fail,  when  read  as  a  relevant  conditional. 
This  can  be  cashed  out  as  follows. 

Definition  66  (Relevant  Predication)  F  is  a  relevant  property  of  a  (writ- 
ten (pxFx)a)  if  and  only  if  (Vx)  (x  =  a  — >  Fx). 

If  F  is  a  relevant  property  of  a  then  Fa  holds  (quite  clearly)  and  if  F  and  G  are 
relevant  properties  of  a  then  so  is  their  conjunction,  and  the  disjunction  of 
any  relevant  properly  with  anything  at  all  is  still  a  relevant  property. 
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Relevant  logics  excel  at  telling  you  what  follows  from  what  as  a  matter  of 
logic  —  this  gives  us  an  interesting  picture  of  the  logical  structure  of  relevant 
predication.  However,  that  is  only  half  the  story.  Applying  the  semantics  of 
relevant  logics  ought  to  give  us  insight  into  what  it  is  to  that  a  relevant  impli- 
cation is  true.  That  task  is  as  yet,  undone. 

4.2. 1    Monism  and  Pluralism 

One  debate  in  philosophical  logic  has  been  inspired  by  work  in  relevance 
and  substructural  logic,  and  we  have  already  seen  a  hint  of  it  in  the  discus- 
sion of  disjunctive  syllogism  in  Section  2.4.  This  is  a  debate  between  plural- 
ists  [22,  23]  and  monists  [218,  225]  with  respect  to  logical  consequence.  Is 
there  one  relation  of  deductive  logical  consequence  (relative,  say,  to  a  partic- 
ular choice  of  language,  if  this  is  a  concern),  or  are  there  more  than  one?  To 
make  the  discussion  particular,  given  a  particular  instance  of  the  inference  of 
disjunctive  syllogism 

AVB,-AhB 

should  the  reasoner  accept  the  inference  as  valid,  reject  it  as  invalid,  or  is 
there  more  to  be  said?  In  an  interpretation  which  gives  a  counterexample  to 
this  inference,  we  may  have  a  "point"  x  such  that  x  Ih  A,  x  II — A  and  x  \f  B. 
What  are  we  to  say  about  this? 

Monists  will  say  that  if  the  choice  of  interpretations  is  correct,  then  this 
provides  a  counterexample  to  the  inference.  If  the  choice  of  interpretations  is 
not  a  good  one  (if  the  interpretations  are  a  model  of  a  logic  but  not  of  the  One 
True  Logic)  then  the  argument  may  well  still  be  valid.  For  example,  Priest  [218] 
argues  that  for  an  argument  to  be  valid,  it  must  be  that  in  every  circumstance 
in  which  the  premises  are  true  so  is  the  conclusion,  and  the  One  True  Logic 
is  one  which  is  sound  and  complete  for  the  intended  interpretation  on  the 
actual  class  of  circumstances.  Any  logical  consequence  relation  other  than 
this  either  undergenerates  by  adding  extra  circumstances  (which  are  alleged 
counterexamples  to  really  valid  arguments)  or  overgenerates  by  missing  some 
out  (which  are  counterexamples  to  invalid  arguments  missed  out  by  the  logic 
which  is  too  strong). 

Pluralists  about  logical  consequence,  on  the  other  hand  will  say  that  a 
logic  (and  its  attendant  interpretations)  may  give  us  some  information  about 
the  inference,  but  that  this  may  not  be  the  whole  story  about  its  validity  or 
otherwise.  For  example,  a  pluralist  may  agree  that  there  are  indeed  circum- 
stances in  which  the  premises  of  a  disjunctive  syllogism  are  true  and  the  con- 
clusion untrue.  However,  this  choice  of  circumstances  may  include  special 
circumstances  not  always  considered:  it  includes  impossible  circumstances, 
as  one  would  expect,  if  we  are  taking  relevance  seriously.  It  is  natural  too,  to 
only  consider  possible  circumstances,  and  if  these  are  the  only  circumstances 
to  consider,  then  disjunctive  syllogism  ought  be  considered  valid  in  this  new, 
restricted  sense.  It  is  a  lesson  of  relevant  logic  and  its  semantics  that  these  are 
choices  which  can  be  made.  For  a  monist,  there  is  one  definitive  best  answer 
to  this  choice.  For  a  pluralist,  both  sides  may  have  competing  merits. 

Pluralism  extends  beyond  our  interpretation  of  the  semantics  into  our 
interpretation  of  proof  theory  too.  Substructural  logics  have  shown  us  that 
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there  is  remarkable  robustness  in  the  interpretation  of  a  conditional  by  means 
of  the  residuation  clause: 

X,  A  h  B 
XhA-sB 

However,  the  introduction  and  elimination  rules  for  a  conditional  laid  down 
by  this  clause,  does  not  determine  the  meaning  of  the  conditional.98  These 
rules  only  pick  out  a  fixed  interpretation  in  combination  with  some  account 
of  the  behaviour  of  the  structural  feature  of  the  comma. 

At  the  very  least,  relevant  and  substructural  logics  have  provided  so  many 
new  tools  for  understanding  logical  consequence  that  they  have  put  the  issue 
of  pluralism  on  the  agenda.  Clarifying  these  options  will  deepen  our  under- 
standing of  logical  consequence. 
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